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(37 CFR 1.9(f) AND 1.27(c) - SMALL BUSINESS CONCERN 
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ADDRESS OF CONCERN Greenburn Road, Bucksburn, Aberdeen, AB21 9SB. United Kingdom 
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ASST. COMMISSIONER FOR PATENTS 
Box PCT 

WASHINGTON, DC 2 0231 
Dear Sir: 

PRELIMINARY AMENDMENT 

Prior to computation of the filing fee and prior to 
examination of the application, please amend certain of the 
claims as set forth in Attachment 1. 

By entry of this preliminary amendment, a prompt and 
thorough examination of this application on the merits is 
olici ted . 

Respectfully submitted, 



Charles N. Quinn 
Reg. No. 27,223 
Attorney for Applicant 
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Express Mail #EL261 100966 US 



Certificate of Express Mailing Under 37 CFR 1.10 



I hereby certify that this PRELIMINARY AMENDMENT is being 
deposited with the United States Postal Service as Express Mail 
in an envelope, bearing U.S. Express Mail label number 
#EL261100966US and the required postage, addressed to Assistant 
Commissioner for Patents, Box PCT, Washington, D.C. 20231, on the 
date appearing below: 



By:. 




Charles N. Quinn 
March 2, 2000 
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ATTACHMENT 1 



4. (Amended) A peptide display carrier package (PDCP) as 
claimed in [any one of Claims] claim 1 [to 3], wherein said 
target peptide portion is displayed externally on the 
package . 

5. (Amended) A peptide display carrier package (PDCP) as 
claimed in [any one of Claims] claim 1 [to 4] wherein said 
recombinant polynucleotide includes a linker sequence 
between the nucleotide sequence encoding the nucleotide 
binding portion and the nucleotide sequence encoding the 
target peptide portion. 

6. (Amended) A peptide display carrier package (PDCP) as 
claimed in [any one of Claims] claim 1 [to 5] wherein said 
recombinant polynucleotide has two or more nucleotide 
sequence motifs each of which can be bound by the 
nucleotide binding portion of the chimeric protein. 

7. (Amended) A peptide display carrier package (PDCP) as 
claimed in [any one of Claims] claim 1 [to 6] wherein said 
nucleotide binding portion is a DNA binding domain of an 
oestrogen or progesterone receptor. 

8. (Amended) A peptide display carrier package (PDCP) as 
claimed in [any one of Claims] claim 1 [to 7] wherein said 
recombinant polynucleotide is bound to said chimeric 
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protein as single stranded DNA. 



9. (Amended) A peptide display carrier package (PDCP) as 
claimed in [any one of Claims] claim 1 [to 8] wherein said 
target peptide portion is located at the N and/or C 
terminal of the chimeric protein. 

10. (Amended) A peptide display carrier package (PDCP) as 
claimed in [any one of Claims] claim 1 [to 9] which is 
produced in a host cell transformed with said recombinant 
polynucleotide and extruded therefrom without lysis of the 
host cell. 

13 . (Amended) A recombinant polynucleotide as claimed in 
[either one of Claims] claim 11 [and 12] which includes a 
linker sequence between the nucleotide sequence encoding 
the nucleotide binding portion and the nucleotide sequence 
encoding the target peptide portion. 

14. (Amended) A recombinant polynucleotide as claimed in 
[either one of Claims] claim 11 [to 13] which has two or 
more nucleotide sequence motifs each of which can be bound 
by the nucleotide binding portion of the chimeric protein. 

15 . (Amended) A recombinant polynucleotide as claimed in 
[either one of Claims] claim 11 [to 14] wherein said 
nucleotide binding portion is a DNA binding domain of an 
estrogen or progesterone receptor. 
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16. (Amended) , A recombinant polynucleotide as claimed in 
[either one of Claims] claim 11 [to 15] wherein said 
recombinant polynucleotide is bound to said chimeric 
protein as single stranded DNA. 

19 . (Amended) A genetic construct or set of genetic constructs 
as claimed in [either one of Claims] claim 17 [and 18] 
which includes a vector pDM12 or pDMl4 or pDMl6, deposited 
at NCIMB under Nos 40970, 40971 and 40972 respectively. 



C. \DATA\MURGIT\1015-00 . acl March 2 , 2000 T-.jmn 



WO 99/11785 




1 

2 
3 
4 
5 
6 
7 
S 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



Chimeric binding peptide library screening method 

The present invention relates generally to methods for 
screening nucleotide libraries for sequences that 
encode peptides of interest. 

Isolating an unknown gene which encodes a desired 
peptide from a recombinant DNA library can be a 
difficult task. The use of hybridisation probes may 
facilitate the process, but their use is generally 
dependent on knowing at least a portion of the sequence 
of the gene which encodes the protein. When the 
sequence is not known, DNA libraries can be expressed 
in an expression vector, and antibodies have been used 
to screen for plaques or colonies displaying the 
desired protein antigen. This procedure has been useful 
in screening small libraries, but rarely occurring 
sequences which are represented in less than about 1 in 
10 5 clones (as is the case with rarely occurring cDNA 
molecules or synthetic peptides) can be easily missed, 
making screening libraries larger than 10 6 clones at 
best laborious and difficult. Methods designed to 
address the isolation of rarely occurring sequences by 
screening libraries of 10 6 clones have been developed 
and include phage display methods and LacI fusion phage 
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1 display, discussed in more detail below. 
2 

3 Phage display methods . Members of DNA libraries which 

4 are fused to the N-terminal end of filamentous 

5 bacteriophage pill and pVIII coat proteins have been 

6 expressed from an expression vector resulting in the 

7 display of foreign peptides on the surface of the phage 

8 particle with the DNA encoding the fusion protein 

9 packaged in the phage particle (Smith G. P., 1985, 

10 Science 228: 1315-1317) . The expression vector can be 

11 the bacteriophage genome itself, or a phagemid vector, 

12 into which a bacteriophage coat protein has been 

13 cloned. In the latter case, the host bacterium, 

14 containing the phagemid vector, must be co-infected 

15 with autonomously replicating bacteriophage, termed 

16 helper phage, to provide the full complement of 

17 proteins necessary to produce mature phage particles. 

18 The helper phage normally has a genetic defect in the 

19 origin of replication which results in the preferential 
2 0 packaging of the phagemid genome. Expression of the 

21 fusion protein following helper phage infection, allows 

22 incorporation of both fusion protein and wild type coat 

23 protein into the phage particle during assembly. 

24 Libraries of fusion proteins incorporated into phage, ■ 

25 can then be selected for binding members against 

2 6 targets of interest (ligands) . Bound phage can then be 

27 allowed to reinfect Escherichia, coll (E . coll) bacteria 

28 and then amplified and the selection repeated, 

29 resulting in the enrichment of binding members 

30 (Parmley, S. F. , & Smith, G. P. 1988., Gene 73: 305- 

31 318; Barrett R. w. et al . , 1992, Analytical 

32 Biochemistry 204: 357-3 64 Williamson et al., Proc. 

33 Natl. Acad. Sci. USA, 90: 4141-4145; Marks et al . , 

34 1991, J. Mol. Biol. 222: 581-597). 
35 

3 6 Several publications describe this method. For example, 
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1 US Patent No 5,403,484 describes production of a 

2 chimeric protein formed from the viral coat protein and 

3 the peptide of interest. In this method at least a 

4 functional portion of a viral coat protein is required 

5 to cause display of the chimeric protein or a processed 

6 form thereof on the outer surface of the virus- In 

7 addition, US Patent No 5,571,698 describes a method for 

8 obtaining a nucleic acid encoding a binding protein, a 

9 key component of which comprises preparing a population 

10 of amplifiable genetic packages which have a 

11 genetically determined outer surface protein, to cause 

12 the display of the potential binding domain on the 

13 outer surface of the genetic package. The genetic 

14 packages are selected from the group consisting of 

15 cells, spores and viruses. For example when the 

16 genetic package is a bacterial cell, the outer surface 

17 transport signal is derived from a bacterial outer 

18 surface protein, and when the genetic package is a 

19 filamentous bacteriophage, the outer surface transport 
2 0 signal is provided by the gene pill (minor coat 

21 protein) or pVIII (major coat protein) of the 

22 filamentous phage. 
23 

2 4 WO-A-92/0104 7 and WO-A-9 2 / 2 07 9 1 describe methods for 

25 producing multimeric specific binding pairs, by 

2 6 expressing a first polypeptide chain fused to a viral 

27 coat protein, such as the gene pill protein, of a 

28 secreted replicable genetic display package (RGDP) 

29 which displays a polypeptide at the surface of the 

3 0 package, and expressing a second polypeptide chain of 
31 the multimer, and allowing the two chains to come 

3 2 together as part of the RGDP. 
33 

3 4 Lad fusion plasmid display . This method is based on 

3 5 the DNA binding ability of the lac repressor. Libraries 

3 6 of random peptides are fused to the lacl repressor 
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1 protein, normally to the C-terminal end, through 

2 expression from a plasmid vector carrying the fusion 

3 gene. Linkage of the Lacl-peptide fusion to its 

4 encoding DNA occurs via the lacO sequences on the 

5 plasmid, forming a stable peptide-LacI-peptide complex. 

6 These complexes are released from their host bacteria 

7 by cell lysis, and peptides of interest isolated by 

8 affinity purification on an immobilised target. The 

9 plasmids thus isolated can then be reintroduced into E. 

10 coli by electroporation to amplify the selected 

11 population for additional rounds of screening (Cull, M. 

12 G. et al. 1992. Proc. Natl. Acad. Sci. U.S.A. 89:1865- 

13 1869). 
14 

15 US Patent No 549853 0 describes a method for 

16 constructing a library of random peptides fused to a 

17 DNA binding protein in appropriate host cells and 

18 culturing the host cells under conditions suitable for 

19 expression of the fusion proteins intra-cellularly , in 

20 the cytoplasm of the host cells. This method also 

21 teaches that the random peptide is located at the 

22 carboxy terminus of the fusion protein and -that the 
2 3 fusion protein-DNA complex is released from the host 
24 cell by cell lysis. No method is described for the 

2 5 protection of the DNA from degradation once released 

26 from the lysed cell. Several DNA binding proteins are 

27 claimed but no examples are shown except lacl. 
28 

29 There remains a need for methods of constructing 

3 0 peptide libraries in addition to the methods described 

31 above. For instance, the above methods do not permit 

32 production of secreted peptides with a free carboxy 
3 3 terminus. The present invention describes an 

34 alternative method for isolating peptides of interest 

3 5 from libraries and has significant advantages over the 

3 6 prior art methods. 
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1 In general terms, the present invention provides a 

2 method for screening a nucleotide library (usually a 

3 DNA library) for a nucleotide sequence which encodes a 

4 target peptide of interest. The method involves 

5 physically linking each peptide to a polynucleotide 

6 including the specific nucleotide sequence encoding 

7 that peptide. Linkage of a peptide to its encoding 

8 nucleotide sequence is achieved via linkage of the 

9 peptide to a nucleotide binding domain. A bifunctional 

10 chimeric protein with a nucleotide binding domain and a 

11 library member or target peptide (preferably with a 

12 function of interest) is thus obtained. The peptide of 

13 interest is bound to the polynucleotide encoding that 

14 peptide via the nucleotide binding domain of the 

15 chimeric protein. 
16 

17 The polynucleotide-chimeric protein complex is then 

18 incorporated within a peptide display carrier package 

19 (PDCP) , protecting the polynucleotide from subsequent 

20 degradation, while displaying the target peptide 

21 portion on the outer surface of the peptide display 

22 carrier package (PDCP) . 
23 

24 Thus, in one aspect, the present invention provides a 

25 peptide display carrier package (PDCP) , said package 

26 comprising a polynucleotide-chimeric protein complex 

27 wherein the chimeric protein has a nucleotide binding 

28 portion and a target peptide portion, wherein said 

29 polynucleotide comprises a nucleotide sequence motif 

30 which is specifically bound by said nucleotide binding 

31 portion, and wherein at least the chimeric protein 

32 encoding portion of the polynucleotide not bound by the 

33 nucleotide binding portion of the chimeric protein is 

34 protected. 
35 

3 6 In one embodiment the polynucleotide is protected by a 
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1 protein which binds non-specif ically to naked 

2 polynucleotide. Examples include viral coat proteins, 

3 many of which are well-known in the art. Where the 

4 chosen viral coat protein requires an initiation 

5 sequence to commence general binding to the 

6 polynucleotide, this will be provided on the 

7 polynucleotide at appropriate location (s) . A preferred 

8 coat protein is coat protein from a bacteriophage, 

9 especially M13 . 
10 

11 Generally, the nucleic binding portion of the chimeric 

12 protein is selected for its specificity for the 

13 nucleotide sequence motif present in the recombinant 

14 polynucleotide encoding the chimeric protein. 
15 

16 Optionally, the nucleotide sequence motif may be an 

17 integral part of the protein encoding region of the 

18 polynucleotide. Alternatively, and more usually, the 

19 motif may be present in a non-coding region of the 

20 polynucleotide. For the purposes of this invention, 

21 all that is required is for the motif to be located on 

22 the polynucleotide such that the nucleotide binding 

23 portion of the chimeric protein is able to recognise 

24 and bind to it. Desirably the polynucleotide-chimeric 

25 protein complex has a dissociation constant of at least 

26 one hour. 
27 

28 Optionally, the recombinant polynucleotide may comprise 

29 two or more nucleotide sequence motifs, each of which 
3 0 will be bound by a chimeric protein molecule. 

31 Preferably, the motifs are positioned along the length 

3 2 of the polynucleotide to avoid steric hindrance between 

3 3 the bound chimeric proteins. 
34 

35 Preferably, the nucleotide sequence motif is not 

3 6 affected by the presence of additional nucleotide 
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1 sequence (e.g. encoding sequence) at its 5' and/or 3' 

2 ends. Thus the chimeric fusion protein may include a 

3 target peptide portion at its N terminal end, at its c 

4 terminal end or may include two target peptide portions 

5 (which may be the same or different) at each end of the 

6 nucleotide binding portion, ie at both the N and C 

7 terminal ends of the chimeric protein. For example one 

8 target peptide may be an antibody of known specificity 

9 and the other peptide may be a peptide of potential 
10 interest. 

11 

12 Desirably the target peptide portion of the chimeric 

13 protein is displayed externally on the peptide display 

14 carrier package, and is thus available for detection, 

15 reaction and/ or binding. 
16 

17 In more detail the PDCP may be composed two distinct 

18 elements: 

19 a. A polynucleotide-chimeric protein complex. This 

20 links the displayed target peptide portion to the 

21 polynucleotide encoding that peptide portion 

22 through a specific polynucleotide binding portion. 
2 3 The nucleotide sequence encoding the chimeric 

2 4 protein, and the specific nucleotide sequence 

25 motif recognised by the nucleotide binding portion 

2 6 of the chimeric protein must be present on a 

27 segment of polynucleotide which can be 

28 incorporated into the PDCP; and 

29 b. A protective coat. This may be supplied by a 

3 0 replicable carrier or helper package capable of 

31 independent existence. Alternatively, a coat 

32 protein could be encoded by the recombinant 

33 polynucleotide of the invention. The protective 

34 coat for the polynucleotide-chimeric protein 

35 complex may be composed of a biological material 

36 such as protein or lipid, but the protective coat 
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1 is not required for linking the target peptide to 

2 the polynucleotide encoding that peptide. The 

3 protective coat must allow the display of the 

4 target peptide portion of the chimeric protein on 

5 its outer surface. The carrier or helper package 

6 may also provide the mechanism for releasing the 

7 intact PDCP from host cells when so required. By 

8 way of example, when a bacteriophage is the 

9 replicable carrier package, a protein coat of the 
3_0 bacteriophage surrounds the polynucleotide- 

3_! chimeric protein complex to form the PDCP, which 

2_2 is then extruded from the host bacterial cell. 

13 

14 The invention described herein demonstrates that 

15 peptides fused to a nucleotide binding domain can be 

16 displayed externally, even through a bacteriophage 

17 carrier package protein coat, while still bound to the 

18 polynucleotide encoding the displayed peptide. 
19 

2 0 The present invention also provides a recombinant 

21 polynucleotide comprising a nucleotide sequence 

22 encoding a chimeric protein having a nucleotide binding 

23 portion operably linked to a target peptide portion, 

24 wherein said polynucleotide includes a specific 
2 5 nucleotide sequence motif which is bound by the 

2 6 nucleotide binding portion of said chimeric protein and 

27 further encoding a non-sequence-specific nucleotide 

2 8 binding protein. 
29 

30 Desirably, the recombinant polynucleotide is a 

31 recombinant expression system, able to express the 

32 chimeric protein when placed in a suitable environment, 

3 3 for example a compatible host cell. After its 

34 expression, the chimeric protein binds to the specific 

35 nucleotide sequence (motif) present in the 

3 6 polynucleotide comprising the nucleotide sequence 
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1 encoding the chimeric protein. 
2 

3 Optionally there may be a linker sequence located 

4 between the nucleotide sequence encoding the nucleotide 

5 binding portion and the polynucleotide inserted into 

6 the restriction enzyme site of the construct. 
7 

8 Desirably the nucleotide binding portion is a DNA 

9 binding domain of an oestrogen or progesterone 

10 receptor, or a functional equivalent thereof. Examples 

11 of sequences encoding such nucleotide binding portions 

12 are set out in SEQ ID Nos 11 and 13. 
13 

14 The term "expression system" is used herein to refer to 

15 a genetic sequence which includes a protein-encoding 

16 region and is operably linked to all of the genetic 

17 signals necessary to achieve expression of that region. 

18 Optionally, the expression system may also include 

19 regulatory elements, such as a promoter or enhancer to 

20 increase transcription and/or translation of the 

21 protein encoding region or to provide control over 

22 expression. The regulatory elements may be located 

23 upstream or downstream of the protein encoding region 

2 4 or within the protein encoding region itself. Where 

25 two or more distinct protein encoding regions are 

26 present these may use common regulatory element (s) or 

27 have separate regulatory element(s). 
28 

29 Generally, the recombinant polynucleotide described 

30 above will be DNA. Where the expression system is 

31 based upon an M13 vector, usually the polynucleotide 

32 binding portion of the expressed chimeric portion will 

33 be single-stranded DNA. However, other vector systems 

34 may be used and the nucleotide binding portion may be 

35 selected to bind preferentially to double-stranded DNA 

3 6 or to double or single-stranded RNA, as convenient. 
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1 Additionally the present invention provides a vector 

2 containing such a recombinant expression system and 

3 host cells transformed with such a recombinant 

4 expression system (optionally in the form of a vector) . 
5 

6 Whilst the recombinant polynucleotide described above 

7 forms an important part of the present invention, we 

8 are also concerned with the ability to screen large 

9 (e.g. of at least 10 5 members, for example 10 6 or even 

10 10 7 members) libraries of genetic material. One of the 

11 prime considerations therefore is the provision of a 

12 recombinant genetic construct into which each member of 

13 said library can individually be incorporated to form 

14 the recombinant polynucleotide described above and to 

15 express the chimeric protein thereby encoded (the 

16 target peptide of which is encoded by the nucleotide 

17 library member incorporated into the construct) . 
18 

19 Thus viewed in a further aspect the present invention 

2 0 provides a genetic construct or set of genetic 

21 constructs comprising a polynucleotide having a 

22 sequence which includes: 
23 

24 i) a sequence encoding a nucleotide binding portion 
2 5 able to recognise and bind to a specific sequence 

2 6 motif; 

27 ii) the sequence motif recognised and bound by the 

28 nucleotide binding portion encoded by (i) ; 

29 iii) a restriction enzyme site which permits insertion 

3 0 of a polynucleotide, said site being designed to 
31 operably link said polynucleotide to the sequence 
3 2 encoding the nucleotide binding portion so that 

3 3 expression of the operably linked polynucleotide 

34 sequences yields a chimeric protein; and 

3 5 iv) a sequence encoding a nucleotide binding protein 
3 6 which binds non-specif ically to naked 
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1 polynucleotide. 

2 

3 Optionally there may be a linker sequence located 

4 between the nucleotide sequence encoding the nucleotide 

5 binding portion and the sequence of the polynucleotide 

6 from the library inserted into the restriction enzyme 

7 site of the construct. 
8 

9 Desirably the nucleotide binding portion is a DNA 

10 binding domain of an oestrogen or progesterone 

11 receptor, or a functional equivalent thereof. Examples 

12 of sequences encoding such nucleotide binding portions 

13 are set out in SEQ ID Nos 11 and 13. 
14 

15 Suitable genetic constructs according to the invention 

16 include pDM12 , pDM14 and pDM16, deposited at NCIMB on 

17 28 August 1998 under Nos NCIMB 40970, NCIMB 40971 and 

18 NCIMB 4 097 2 respectively. 
19 

20 It is envisaged that a- conventionally produced genetic 

21 library may be exposed to the genetic construct (s) 

22 described above. Thus, each individual member of the 

23 genetic library will be separately incorporated into 

24 the genetic construct and the library will be present 

2 5 in the form of a library of recombinant polynucleotides 

2 6 (as described above) , usually in the form of vectors, 

27 each recombinant polynucleotide including as library 

2 8 member . 
29 

3 0 Thus, in a further aspect, the present invention 

31 provides a library of recombinant polynucleotides (as 

3 2 defined above) wherein each polynucleotide includes a 

3 3 polynucleotide obtained from a genetic library and 

3 4 which encodes the target peptide portion of the 

3 5 chimeric protein expressed by the recombinant 

3 6 polynucleotide. 
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1 optionally, the chimeric protein may further include a 

2 linker sequence located between the nucleotide binding 

3 portion and the target peptide portion. The linker 

4 sequence will reduce steric interference between the 

5 two portions of the protein. Desirably the linker 

6 sequence exhibits a degree of flexibility. 
7 

8 Also disclosed are methods for constructing and 

9 screening libraries of PDCP particles, displaying many 

10 different peptides, allowing the isolation and 

11 identification of particular peptides by means of 

12 affinity techniques relying on the binding activity of 

13 the peptide of interest. The resulting polynucleotide 

14 sequences can therefore be more readily identified, re- 

15 cloned and expressed. 
16 

17 A method of constructing a genetic library, said method 

18 comprising: 
19 

2 0 a) constructing multiple copies of a recombinant 

21 vector comprising a polynucleotide sequence which 

22 encodes a nucleotide binding portion able to 

2 3 recognise and bind to a specific sequence motif 

24 (and optionally also including the specific 

25 sequence motif) ; 
26 

27 b) operably linking each said vector to a 

28 polynucleotide encoding a target polypeptide, such 

29 that expression of said operably linked vector 

3 0 results in expression of a chimeric protein 

31 comprising said target peptide and said nucleotide 

32 binding portions; wherein said multiple copies of 

33 said operably linked vectors collectively express 

34 a library of target peptide portions; 
35 

36 c) transforming host cells with the vectors of step 
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1 b) ; 

2 

3 d) culturing the host cells of step c) under 

4 conditions suitable for expression of said 

5 chimeric protein; 
6 

7 e) providing a recombinant polynucleotide comprising 

8 the nucleotide sequence motif specifically 

9 recognised by the nucleotide binding portion and 

10 exposing this polynucleotide to the chimeric 

11 protein of step d) to yield a polynucleotide- 

12 chimeric protein complex; and 
13 

14 f) causing production of a non-sequence-specific 

15 moiety able to bind to the non-protected portion 

16 of the polynucleotide encoding the chimeric 

17 protein to form a peptide display carrier package. 
18 

19 The present invention further provides a method of 

2 0 screening a genetic library, said method comprising: 
21 

2 2 a) exposing the polynucleotide members of said 

23 library to multiple copies of a genetic construct 

24 comprising a nucleotide sequence encoding a 

25 nucleotide binding portion able to recognise and 

2 6 bind to a specific sequence motif, under 

27 conditions suitable for the polynucleotides of 

28 said library each to be individually ligated into 

29 one copy of said genetic construct, to create a 

3 0 library of recombinant polynucleotides; 
31 

32 b) exposing said recombinant polynucleotides to a 

33 population of host cells, under conditions 

34 suitable for transformation of said host cells by 

35 said recombinant polynucleotides; 
36 
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l c) selecting for transformed host cells; 
2 

3 d) exposing said transformed host cells to conditions 

4 suitable for expression of said recombinant 

5 polynucleotide to yield a chimeric protein; and 
6 

7 e) providing a recombinant polynucleotide comprising 

8 the nucleotide sequence motif specifically 

9 recognised by the nucleotide binding portion and 

10 exposing this polynucleotide to the chimeric 

11 protein of step d) to yield a polynucleotide- 

12 chimeric protein complex; 
3.3 

14 f) protecting any exposed portions of the 

15 polynucleotide in the complex of step e) to form a 

16 peptide display carrier package; and 
17 

18 g) screening said peptide display carrier package to 

19 select only those packages displaying a target 
2 0 peptide portion having the characteristics 

21 required. 
22 

23 Desirably in step a) the genetic construct is pDM12 , 

2 4 pDM14 or pDM16. 

25 

2 6 Desirably in step f) the peptide display package 

2 7 carrier is extruded from the transformed host cell 

2 8 without lysis of the host cell. 
29 

3 0 Generally the transformed host cells will be plated out 
3 1 or otherwise divided into single colonies following 

3 2 transformation and prior to expression of the chimeric 

3 3 protein. 

34 

3 5 The screening step g) described above may look for a 

3 6 particular target peptide either on the basis of 



SUBSTITUTE SHEET (RULE 25) 



WO 99/11785 PCT/GB98/02630 



1 function (e.g. enzymic activity) or structure (e.g. 

2 binding to a specific antibody) . Once the peptide 

3 display carrier package is observed to include a target 

4 peptide with the desired characteristics, the 

5 polynucleotide portion thereof (which of course encodes 

6 the chimeric protein itself) can be amplified, cloned 

7 and otherwise manipulated using standard genetic 

8 engineering techniques. 
9 

10 The current invention differs from the prior art 

11 teaching of the previous disclosures US Patent No 

12 5,403,484 and US Patent No 5,571,698, as the invention 

13 does not require outer surface transport signals, or 

14 functional portions of viral coat proteins, to enable 

15 the display of chimeric binding proteins on the outer 

16 surface of the viral particle or genetic package. 
17 

18 The current invention also differs from the teaching of 

19 WO-A-92/01047 and WO-A-92/20791 , as no component of a 

20 secreted replicable genetic display package, or viral 

21 coat protein is required, to enable display of the 

22 target peptide on the outer surface of the viral 

23 particle. 
24 

2 5 The current invention differs from the teaching of US 

26 Patent No 5498530, as it enables the display of 

27 chimeric proteins, linked to the polynucleotide 

28 encoding the chimeric protein, extra-cellular ly, not in 

29 the cytoplasm of a host cell. In the current invention 

3 0 the chimeric proteins are presented on the outer 

31 surface of a peptide display carrier package (PDCP) 

32 which protects the DNA encoding the chimeric protein, 

33 and does not require cell lysis to obtain access to the 

34 chimeric protein-DNA complex. Finally, the current 
3 5 invention does not rely upon the lad DNA binding 
36 protein to form the chimeric protein-DNA complex. 
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1 In one embodiment of the invention, the nucleotide 

2 binding portion of the chimeric protein comprises a DNA 

3 binding domain from one or more of the nuclear steroid 

4 receptor family of proteins, or a functional equivalent 

5 of such a domain. Particular examples include (but are 

6 not limited to) a DNA binding domain of the oestrogen 

7 receptor or the progesterone receptor, or functional 

8 equivalents thereof. These domains can recognise 

9 specific DNA sequences, termed hormone response 

10 elements (HRE) , which can be bound as both double and 

11 single-stranded DNA. The DNA binding domain of such 

12 nuclear steroid receptor proteins is preferred. 
13 

14 The oestrogen receptor is especially referred to below 

15 by way of example, for convenience since: 

16 (a) The oestrogen receptor is a large multifunctional 

17 polypeptide of 59 5 amino acids which functions in the 

18 cytoplasm and nucleus of eukaryotic cells (Green et 

19 al., 1986, Science 231: 1150-1154). A minimal high 
2 0 affinity DNA binding domain (DBD) has been defined 

21 between amino acids 176 and 282 (Mader et al., 1993, 

22 Nucleic Acids Res. 21: 1125-1132). The functioning of 

2 3 this domain (i.e. DNA binding) is not inhibited by the 

2 4 presence of non-DNA binding domains at both the N and C 

2 5 terminal ends of this domain, in the full length 

2 6 protein. 
27 

2 8 (b) The oestrogen receptor DNA binding domain fragment 
29 (amino acids 176-282) has been expressed in E. coli and 

3 0 shown to bind to the specific double stranded DNA 

31 oestrogen receptor target HRE nucleotide sequence, as a 

32 dimer with a similar af f inity .. ( 0 . 5nM) to the parent 

3 3 molecule (Murdoch et al. 1990, Biochemistry 29: 8377- 

34 8385; Mader et al. , 1993, Nucleic Acids Research 21: 

3 5 112 5-1132) . DBD dimerization on the surface of the PDCP 

36 should result in two peptides displayed per particle. 
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1 This bivalent display can aid in the isolation of low 

2 affinity peptides and peptides that are required to 

3 form a bivalent conformation in order to bind to a 

4 particular target, or activate a target receptor- The 

5 oestrogen receptor is capable of binding to its 3 8 base 

6 pair target HRE sequence, consensus sequence: 
7 

8 1) 5 ' -TCAGGTCAGAGTGACCTGAGCTAAAATAACACATTCAG-3 ' 

9 ("minus strand") SEQ ID No 1, and 
10 

H 2 ) 3 ' -AGTCCAGTCTCACTGGACTCGATTTTATTGTGTAAGTC-5 ' 

12 ("plus strand") SEQ ID No 2, 

13 

14 with high affinity and specificity, under the salt and 

15 pH conditions normally required for selection of 

16 binding peptides. Moreover, binding affinity is 

17 increased 60-fold for the single-stranded coding, or 

18 "plus", strand (i.e. SEQ ID No 2) of the HRE nucleotide 

19 sequence over the double stranded form of the specific 

20 target nucleotide sequence (Peale et al. 1988, Proc. 

21 Natl. Acad. Sci. USA 85: 1038-1042; Lannigan & Notides, 

22 1989, Proc. Natl. Acad. Sci. USA 86: 863-867). 
23 

24 In an embodiment of the invention where the DNA binding 

25 component of the peptide display carrier package is the 

26 oestrogen receptor, the nucleotide (DNA) binding 

27 portion contains a minimum sequence of amino acids 176- 

28 282 of the oestrogen receptor protein. In addition, the 

29 consensus oestrogen receptor target HRE sequence is 

3 0 cloned in such a way that if single stranded DNA can be 

31 produced then the coding, or "plus", strand of the 

3 2 oestrogen receptor HRE nucleotide sequence is 

3 3 incorporated into single-stranded DNA. An example of a 

3 4 vector suitable for this purpose is pUC119 (see Viera 

35 et al., Methods in Enzymology, Vol 153, pages 3-11, 

36 1987). 
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1 In a preferred embodiment of the invention a peptide 

2 display carrier package (PDCP) can be assembled when a 

3 bacterial host cell is transformed with a bacteriophage 

4 vector, which vector comprises a recombinant 

5 polynucleotide as described above. The expression 

6 vector will also comprise the specific nucleotide motif 

7 that can be bound by the nucleotide binding portion of 

8 the chimeric protein. Expression of recombinant 

9 polynucleotide results in the production of the 

10 chimeric protein which comprises the target peptide and 

11 the nucleotide binding portion. The host cell is grown 

12 under conditions suitable for chimeric protein 

13 expression and assembly of the bacteriophage particles, 

14 and the association of the chimeric protein with the 

15 specific nucleotide sequence in the expression vector. 
16 

17 In this embodiment, since the vector is a 

18 bacteriophage, which replicates to produce a single- 

19 stranded DNA, the nucleotide binding portion preferably 
2 0 has an affinity for single-stranded DNA. Incorporation 
21 of the vector single-stranded DNA-chimeric protein 

2 2 complex into bacteriophage particles results in the 

2 3 assembly of the peptide display carrier package (PDCP) , 

2 4 and display of the target peptide on the outer surface 

2 5 of the PDCP. 

26 

2 7 In this embodiment both of the required elements for 

2 8 producing peptide display carrier packages are 

2 9 contained on the same vector. Incorporation of the DNA- 

3 0 chimeric protein complex into a peptide display carrier 
31 package (PDCP) is preferred as DNA degradation is 

3 2 prevented, large numbers of PDCPs are produced per host 

3 3 cell, and the PDCPs are easily separated from the host 

34 cell without recourse to cell lysis. 
35 

3 6 In a more preferred embodiment, the vector of the is a 
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1 phagemid vector (for example pUC119) where expression 

2 of the chimeric protein is controlled by an inducible 

3 promoter. In this embodiment the PDCP can only be 

4 assembled following infection of the host cell with 

5 both phagemid vector and helper phage. The transfected 

6 host cell is then cultivated under conditions suitable 

7 for chimeric protein expression and assembly of the 

8 bacteriophage particles. 
9 

10 In this embodiment the elements of the PDCP are 

11 provided by two separate vectors. The phagemid derived 

12 PDCP is superior to phagemid derived display packages 

13 disclosed in WO-A-92/01047 where a proportion of 

14 packages displaying bacteriophage coat protein fusion 

15 proteins will contain the helper phage DNA, not the 

16 fusion protein DNA sequence. In the current invention, 

17 a PDCP can display the chimeric fusion protein only 

18 when the package contains the specific nucleotide motif 

19 recognised by the nucleotide binding portion. In most 
2 0 embodiments this sequence will be present on the same 

21 DNA segment that encodes the fusion protein. In 

22 addition, the prior art acknowledges that when mutant 

2 3 and wild type proteins are co-expressed in the same 

24 bacterial cell, the wild type protein is produced 

25 preferentially. Thus, when the wild type helper phage, 

26 phage display system of WO-A-92/01047 is used, both 

27 wild type gene pill and target peptide-gene pill 

28 chimeric proteins are produced in the same cell. The 

29 result of this is that the wild type gene pill protein 

3 0 is preferentially packaged into bacteriophage 

31 particles, over the chimeric protein. In the current 

3 2 invention, there is no competition with wild type 

3 3 bacteriophage coat proteins for packaging. 
34 

3 5 Desirably the target peptide is displayed in a location 

3 6 exposed to the external environment of the PDCP, after 
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1 the PDCP particle has been released from the host cell 

2 without recourse to cell lysis. The target peptide is 

3 then accessible for binding to its ligand. Thus, the 

4 target peptide may be located at or near the N-terminus 

5 or the C-terminus of a nucleotide binding domain, for 

6 example the DNA binding domain of the oestrogen 

7 receptor. 
8 

9 The present invention also provides a method for 

10 screening a DNA library expressing one or more 

11 polypeptide chains that are processed, folded and 

12 assembled in the periplasmic space to achieve 

13 biological activity. The PDCP may be assembled by the 

14 following steps: 
15 

16 (a) Construction of N- or C-terminal DBD chimeric 

17 protein fusions in a phagemid vector. 

18 (i) When the target peptide is located at the N- 

19 terminus of the nucleotide binding portion, a library 

20 of DNA sequences each encoding a potential target 

21 peptide is cloned into an appropriate location of an 

22 expression vector (i.e. behind an appropriate promoter 

2 3 and translation sequences and a sequence encoding a 

24 signal peptide leader directing transport of the 

25 downstream fusion protein to the periplasmic space) and 

26 upstream of the sequence encoding the nucleotide 

27 binding portion. In a preferred embodiment the DNA 

28 sequence (s) of interest may be joined, by a region of 

29 DNA encoding a flexible amino acid linker, to the 5'- 

30 end of an oestrogen receptor DBD. 

31 (ii) Alternatively, when the target peptide is 

32 located at the C-terminus of the nucleotide binding 

33 domain, a library of DNA sequences each encoding a 

34 potential target peptide is cloned into the expression 

3 5 vector so that the nucleotide sequence coding for the 
3 6 nucleotide binding portion is upstream of the cloned 
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1 DNA target peptide encoding sequences, said nucleotide 

2 binding portion being positioned behind an appropriate 

3 promoter and translation sequences and a sequence 

4 encoding a signal peptide leader directing transport of 

5 the downstream fusion protein to the periplasmic space. 

6 In a preferred embodiment, DNA sequence (s) of interest 

7 may be joined, by a region of DNA encoding a flexible 

8 amino acid linker oestrogen receptor DBD DNA sequence. 



10 



Located on the expression vector is the specific HRE 

11 nucleotide sequence recognised, and bound, by the 

12 oestrogen receptor DBD. In order to vary the number of 

13 chimeric proteins displayed on each PDCP particle, this 

14 sequence can be present as one or more copies in the 

15 vector. 
16 

17 (b) Incorporation into the PDCP. Non-lytic helper 

18 bacteriophage infects host cells containing the 

19 expression vector. Preferred types of bacteriophage 

20 include the filamentous phage fd, fl and M13 . In a 

21 more preferred embodiment the bacteriophage may be 

22 M13K07. 



2 4 The protein (s) of interest are expressed and 

2 5 transported to the periplasmic space, and the properly 

2 6 assembled proteins are incorporated into the PDCP 

27 particle by virtue of the high affinity interaction of 

2 8 the DBD with the specific target nucleotide sequence 

2 9 present on the phagemid vector DNA which is naturally 

3 0 packaged into phage particles in a single-stranded 
31 form. The high affinity interaction between the DBD 
3 2 protein and its specific target nucleotide sequence 
3 3 prevents displacement by bacteriophage coat proteins 
34 resulting in the incorporation of the protein (s) of 

3 5 interest onto the surface of the PDCP as it is extruded 

36 from the cell. 
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1 (c) selection of the peptide of interest. Particles 

2 which display the peptide of interest are then selected 

3 from the culture by affinity enrichment techniques. 

4 This is accomplished by means of a ligand specific for 

5 the protein of interest, such as an antigen if the 

6 protein of interest is an antibody. The ligand may be 

7 presented on a solid surface such as the surface of an 
a ELISA plate, or in solution. Repeating the affinity 

9 selection procedure provides an enrichment of clones 

10 encoding the desired sequences, which may then be 

11 isolated for sequencing, further cloning and/or 

12 expression. 
13 

14 Numerous types of libraries of peptides fused to the 

15 DBD can be screened under this embodiment including: 
16 

17 (i) Random peptide sequences encoded by synthetic 

18 DNA of variable length. 
19 

20 (ii) Single-chain Fv antibody fragments. These 

21 consist of the antibody heavy and light chain 

22 variable region domains joined by a flexible 

23 linker peptide to create a single-chain antigen 

24 binding molecule. 
25 

26 (iii) Random fragments of naturally occurring 

27 proteins isolated from a cell population 

28 containing an activity of interest. 
29 

3 0 In another embodiment the invention concerns methods 

31 for screening a DNA library whose members require more 

32 than one chain for activity, as required by, for 

3 3 example, antibody Fab fragments for ligand binding. In 

34 this embodiment heavy or light chain antibody DNA is 

35 joined to a nucleotide sequence encoding a DNA binding 

36 domain of, for example, the oestrogen receptor in a 
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1 phagemid vector. Typically the antibody DNA library 

2 sequences for either the heavy (VH and CHI) or light 

3 chain (VL and CL) genes are inserted in the 5' region 

4 of the oestrogen receptor DBD DNA, behind an 

5 appropriate promoter and translation sequences and a 

6 sequence encoding a signal peptide leader directing 

7 transport of the downstream fusion protein to the 

8 periplasmic space. 
9 

XO Thus, a DBD fused to a DNA library member-encoded 

11 protein is produced and assembled in to the viral 

12 particle after infection with bacteriophage. The second 

13 and any subsequent chain (s) are expressed separately 

14 either: 
15 

16 (a) from the same phagemid vector containing the DBD 

17 and the first polypeptide fusion protein, 

18 or 
19 

20 (b) from a separate region of DNA which may be present 

21 in the host cell nucleus, or on a plasmid, phagemid or 

22 bacteriophage expression vector that can co-exist, in 

23 the same host cell, with the first expression vector, 

24 so as to be transported to the periplasm where they 

25 assemble with the first chain that is fused to the DBD 

2 6 protein as it exits the cell. Peptide display carrier 

27 packages (PDCP) which encode the protein of interest 

28 can then be selected by means of a ligand specific for 

29 the protein. 
30 

31 In yet another embodiment, the invention concerns 

32 screening libraries of bi-f unctional peptide display 

3 3 carrier packages where two or more activities of 
34 interest are displayed on each PDCP. In this 

3 5 embodiment, a first DNA library sequence (s) is inserted 

36 next to a first DNA binding domain (DBD) DNA sequence, 



SUBSTITUTE SHEET (RULE 26) 



WO 99/11785 PCT/GB98/02630 

24 

1 for example the oestrogen receptor DBD, in an 

2 appropriate vector, behind an appropriate promoter and 

3 translation sequences and a sequence encoding a signal 

4 peptide leader directing transport of this first 

5 chimeric protein to the periplasmic space. A second 

6 chimeric protein is also produced from the same, or 

7 separate, vector by inserting a second DNA library 

8 sequence (s) next to a second DBD DNA sequence which is 

9 different from the first DBD DNA sequence, for example 

10 the progesterone receptor DBD, behind an appropriate 

11 promoter and translation sequences and a sequence 

12 encoding a signal peptide leader. The first, or only, 

13 vector contains the specific HRE nucleotide sequences 

14 for both oestrogen and progesterone receptors. 

15 Expression of the two chimeric proteins, results in a 

16 PDCP with two different chimeric proteins displayed. As 

17 an example, one chimeric protein could possess a 

18 binding activity for a particular ligand of interest, 

19 while the second chimeric protein could possess an 

2 0 enzymatic activity. Binding by the PDCP to the ligand 

21 of the first chimeric protein could then be detected by 

22 subsequent incubation with an appropriate substrate for 

23 the second chimeric protein. In an alternative 

24 embodiment a bi-f unctional PDCP may be created using a 

25 single DBD, by cloning one peptide at the 5 1 -end of the 

26 DBD, and a second peptide at the 3 ' -end of the DBD. 

27 Expression of this single bi-f unctional chimeric 

28 protein results in a PDCP with two different 

29 activities. 
30 

31 We have investigated the possibility of screening 

32 libraries of peptides, fused to a DNA binding domain 

33 and displayed on the surface of a display package, for 

34 particular peptides with a biological activity of 

3 5 interest and recovering the DNA encoding that activity. 
3 6 Surprisingly, by manipulating the oestrogen receptor 
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1 DNA binding domain in conjunction with M13 

2 bacteriophage we have been able to construct novel 

3 particles which display large biologically functional 

4 molecules, that allows enrichment of particles with the 

5 desired specificity. 
6 

7 The invention described herein provides a significant 

8 breakthrough in DNA library screening technology. 

9 

10 The invention will now be further described by 

11 reference to the non-limiting examples and figures 

12 below. 

13 

14 Description of Figures 

15 

16 Figure 1 shows the pDM12 N-terminal fusion oestrogen 

17 receptor DNA binding domain expression vector 

18 nucleotide sequence (SEQ ID No 3) , between the Hindlll 

19 and EcoRI restriction sites, comprising a pelB leader 

20 secretion sequence (in italics) , multiple cloning site 

21 containing Sfil and NotI sites, flexible (glycine) 4 - 

22 serine linker sequence (boxed) , a fragment of the 

23 oestrogen receptor gene comprising amino acids 176-282 

24 ( SEQ ID No 4) of the full length molecule, and the 38 

25 base pair consensus oestrogen receptor DNA binding 

2 6 domain HRE sequence. 
27 

28 Figure 2 shows the OD 4SOmn ELISA data for negative 

29 control M13K07 phage, and single-clone PDCP display 

3 0 culture supernatants (#1-4, see Example 3) isolated by 

31 selection of the lymphocyte cDNA-pDM12 library against 

32 anti-human immunoglobulin kappa antibody. 
33 

34 Figure 3 shows partial DNA (SEQ ID No 5) and amino acid 

3 5 (SEQ ID No 6) sequence for the human immunoglobulin 

36 kappa constant region (Rabat, E. A. et al., Sequences 
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1 of Proteins of Immunological Interest. 4 th edition. U.S. 

2 Department of Health and Human Services. 1987) , and 

3 ELISA positive clones #2 (SEQ ID Nos 7 and 8) and #3 

4 (SEQ ID Nos 9 and 10) from Figure 2 which confirms the 

5 presence of human kappa constant region DNA in-frame 

6 with the pelB leader sequence (pelB leader sequence is 

7 underlined, the leader sequence cleavage site is 

8 indicated by an arrow). The differences in the 5 '-end 

9 sequence demonstrates that these two clones were 

10 selected independently from the library stock. The PCR 

11 primer sequence is indicated in bold, clone #2 was 

12 originally amplified with CDNAPCRBAK1 and clone #3 was 

13 amplified with CDNAPCRBAK2 . 
14 

15 Figure 4 shows the pDM14 N-terminal fusion oestrogen 

16 receptor DNA binding domain expression vector 

17 nucleotide sequence (SEQ ID No 11) , between the Hindlll 

18 and EcoRI restriction sites, comprising a pelB leader 

19 secretion sequence (in italics) , multiple cloning site 
2 0 containing Sfil and NotI sites, flexible (glycine) 4 - 

21 serine linker sequence (boxed) , a fragment of the 

22 oestrogen receptor gene comprising amino acids 176-282 
2 3 (SEQ ID No 12) of the full length molecule, and the two 
2 4 3 8 base pair oestrogen receptor DNA binding domain HRE 
2 5 sequences (HRE 1 and HRE 2) . 

26 

27 Figure 5 shows the pDM16 C-terminal fusion oestrogen 

2 8 receptor DNA binding domain expression vector 

29 nucleotide sequence (SEQ ID No 13), between the Hindlll 

3 0 and EcoRI restriction sites, comprising a pelB leader 
31 secretion sequence (in italics) , a fragment of the 

3 2 oestrogen receptor gene comprising amino acids 176-282 

3 3 (SEQ ID No 14) of the full length molecule, flexible 

34 (glycine) 4-serine linker sequence (boxed) , multiple 

3 5 cloning site containing Sfil and NotI sites and the 38 

3 6 base pair oestrogen receptor DNA binding domain HRE 
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1 sequence. 
2 

3 Figure 6 shows the OD 450mil ELISA data for N-cadherin- 

4 pDM16 C-terminal display PDCP binding to anti-pan- 

5 cadherin monoclonal antibody in serial dilution ELISA 

6 as ampicillin resitance units (a.r.u.). Background 

7 binding of negative control M13K07 helper phage is also 

8 shown . 
9 

10 Figure 7 shows the OD 450nm ELISA data for in vivo 

11 biotinylated PCC-pDM16 C-terminal display PDCP binding 

12 to streptavidin in serial dilution ELISA as ampicillin 

13 resitance units (a.r.u.)- Background binding of 

14 negative control M13K07 helper phage is also shown. 
15 

16 Figure 8 shows the OD 450mn ELISA data for a human scFv 

17 PDCP isolated from a human scFv PDCP display library 

18 selected against substance P. The PDCP was tested 

19 against streptavidin (1) , streptavidin-biotinylated 

20 substance P (2) , and streptavidin-biotinylated CGRP 

21 (3), in the presence (B) or absence (A) of free 

22 substance P. 
23 

24 Figure 9 shows the DNA (SEQ ID Nos 15 and 17) and amino 

25 acid (SEQ ID No 16 and 18) sequence of the substance P 

26 binding scFv isolated from a human scFv PDCP display 

27 library selected against substance P. Heavy chain (SEQ 

28 ID Nos 15 and 16) and light chain (SEQ ID Nos 17 and 

29 18) variable region sequence is shown with the CDRs 

30 underlined and highlighted in bold. 
31 

32 Materials and Methods 

33 The following procedures used by the present applicant 

34 are described in Sambrook, J., et al. , 1989 supra.: 

3 5 restriction enzyme digestion, ligation, preparation of 

36 electrocompetent cells, electroporation , analysis of 
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1 restriction enzyme digestion products on agarose gels, 

2 DNA purification using phenol/chloroform, preparation 

3 of 2xTY medium and plates, preparation of ampicillin, 

4 kanamycin, IPTG (Isopropyl /3-D-Thiogalactopyranoside) 

5 stock solutions, and preparation of phosphate buffered 

6 saline. 
7 

8 Restriction enzymes, T4 DNA ligase and cDNA synthesis 

9 reagents (Superscript plasmid cDNA synthesis kit) were 

10 purchased from Life Technologies Ltd (Paisley, 

11 Scotland, U.K.)- Oligonucleotides were obtained from 

12 Cruachem Ltd (Glasgow, Scotland, U.K.), or Genosys 

13 Biotechnologies Ltd (Cambridge, U.K.). Taq DNA 

14 polymerase, Wizard SV plasmid DNA isolation kits, 

15 streptavidin coated magnetic beads and mRNA isolation 

16 reagents (PolyATract 1000) were obtained from Promega 

17 Ltd (Southampton, Hampshire, U.K.). Taqplus DNA 

18 polymerase was obtained from Stratagene Ltd (Cambridge, 

19 U.K.). PBS, BSA, streptavidin, substance P and anti-pan 

20 cadherin antibody were obtained from SIGMA Ltd (Poole, 

21 Dorset, U.K.). Anti-M13 -HRP conjugated antibody, 

22 Kanamycin resistant M13K07 helper bacteriophage and 

23 RNAguard were obtained from Pharmacia Ltd (St. Albans, 

24 Herts, U.K.) and anti-human Ig<c antibody from Harlan- 

25 Seralab (Loughborough, Leicestershire, U.K.) 

26 Biotinylated substance P and biotinylated calcitonin 

27 gene related peptide (CGRP) were obtained from 

28 Peninsula Laboratories (St. Helens, Merseyside, U.K.). 
29 

30 Specific embodiments of the invention are given below 

31 in Examples l to 9 . 
32 
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1 Example l. Construction of a N-terminal PDCP display 

2 phagemid vector pDM12. 

3 

4 The pDM12 vector was constructed by inserting an 

5 oestrogen receptor DNA binding domain, modified by 

6 appropriate PCR primers, into a phagemid vector pDM6. 

7 The pDM6 vector is based on the pUC119 derived phage 

8 display vector pHENl (Hoogenboom et al. , 1991, Nucleic 

9 Acids Res. 19: 4133-4137). It contains (Gly) 4 Ser linker, 

10 Factor Xa cleavage site, a full length gene III, and 

11 streptavidin tag peptide sequence (Schmidt, T.G. and 

12 Skerra, A., 1993, Protein Engineering 6: 109-122), all 

13 of which can be removed by Notl-EcoRI digestion and 
3.4 agarose gel electrophoresis, leaving a pelB leader 

15 seguence, Sfil, Ncol and PstI restriction sites 

16 upstream of the digested NotI site. The cloned DNA 

17 binding domain is under the control of the lac promoter 

18 found in pUC119. 
19 

2 0 Preparation of pDM6 

21 

22 The pDM12 vector was constructed by inserting an 

2 3 oestrogen receptor DNA binding domain, modified by 

24 appropriate PCR primers, into a phagemid vector pDM6. 

25 The pDM6 vector is based on the gene pill phage display 

26 vector pHENl (Hoogenboom et al., 1991, Nucleic Acids 

27 Res. 19: 4133-4137), itself derived from pUC119 (Viera, 

28 J. and Messing, J., 1987, Methods in Enzymol. 153: 

29 3-11) . It was constructed by amplifying the pill gene 

30 in pHENl with two oligonucleotides: 
31 

3 2 PDM6BAK: 5 -TTT TCT GCA GTA ATA GGC GGC CGC AGG GGG AGG 
3 3 AGG GTC CAT CGA AGG TCG CGA AGC AGA GAC TGT TGA AAG T-3 
34 (SEQ ID No 19) and 

35 

3 6 PDM6FOR: 5 - TTT TGA ATT CTT ATT AAC CAC CGA ACT GCG 



SUBSTITUTE SHEET (RULE 26) 



WO 99/11785 



PCT/GB98/02630 



1 GGT GAC GCC AAG CGC TTG CGG CCG TTA AGA CTC CTT ATT ACG 

2 CAG-3 <SEQ ID No 20). 
3 

4 and cloning the Pstl-EcoRI digested PCR product back 

5 into similarly digested pHENl, thereby removing the 

6 c-myc tag sequence and supE TAG codon from pHENl. The 

7 pDM6 vector contains a (Gly) 4 Ser linker, Factor Xa 

8 cleavage site, a full length gene III, and streptavidin 

9 tag peptide sequence (Schmidt, T.G. and Skerra, A. , 

10 1993, Protein Engineering 6: 109-122}, all of which can 

11 be removed by Notl-EcoRI digestion and agarose gel 

12 electrophoresis, leaving a pelB leader sequence, Sfil, 

13 Ncol and PstI restriction sites upstream of the 

14 digested NotI site. The cloned DNA binding domain is 

15 under the control of the lac promoter found in pUC119. 
16 

17 The oestrogen receptor DNA binding domain was isolated 

18 from cDNA prepared from human bone marrow (Clontech, 

19 Palo Alto, California, U.S.A.) . cDNA can be prepared by 

2 0 many procedures well known to those skilled in the art. 

21 As an example, the following method using a Superscript 

22 plasmid cDNA synthesis kit can be used: 
23 

24 (a) First strand synthesis. 

25 

26 5/j.g of bone marrow mRNA, in 5/xl DEPC-treated water was 

27 thawed on ice and 2/u.l (5 0pmol) of cDNA synthesis primer 

28 (5 ' -AAAAGCGGCCGCACTGGCCTGAGAGA (N) 6 -3 1 ) (SEQ ID No 21) 

29 was added to the mRNA and the mixture heated to 7 0 °C 

3 0 for 10 minutes, then snap-chilled on ice and spun 

31 briefly to collect the contents to the bottom of the 

32 tube. The following were then added to the tube: 

3 3 lOOOu/ml RNAguard 1^1 

34 5x first strand buffer 4/xl 

3 5 0. 1M DTT 2/il 

3 6 lOmM dNTPs ljul 
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1 2 00u/>l Superscript II reverse transcriptase 5/il 

2 The mixture was mixed by pipetting gently and incubated 

3 at 37 °C for 1 hour, then placed on ice. 
4 

5 (b) Second strand synthesis. 

6 

7 The following reagents were added to the first strand 



8 reaction: 

9 DEPC-treated water 93^1 

10 5x second strand buffer 3 0/xl 

11 lOmM dNTPs 3/il 

12 I0u//xl E. coli DNA ligase 1/xl 

13 10u//xl E, coli DNA polymerase 4/xl 

14 2u//xl E. coli RNase H Ipl 



15 The reaction was vortex mixed and incubated at 16 °C for 

16 2 hours. 2 ju.1 (10u) of T4 DNA polymerase was added and 

17 incubation continued at 16 °C for 5 minutes. The 

18 reaction was placed on ice and 10/zl 0.5M EDTA added, 

19 then phenol-chloroform extracted, precipitated and 

20 vacuum dried. 
21 

22 (c) Sal I adaptor ligation. 

23 

24 The cDNA pellet was resuspended in 25/il DEPC-treated 

2 5 water, and ligation set up as follows. 



26 cDNA 25/x.l 

27 5x T4 DNA ligase buffer lOpl 
2 8 lfx.g/(xl Sal I adapters* 10/x.l 
29 lu/jttlT4 DNA ligase 5(il 



30 *Sal I adapters: TCGACCCACGCGTCCG-3 ' {SEQ ID No 22) 

31 GGGTGCCGAGGC-5 * (SEQ ID No 23) 

32 The ligation was mixed gently and incubated for 16 
3 3 hours at 16 °C, then phenol-chloroform extracted, 

3 4 precipitated and vacuum dried. The cDNA/adaptor pellet 

3 5 was resuspended in 41^1 of DEPC-treated water and 

3 6 digested with 60 units of NotI at 37 °C for 2 hours, 
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1 then phenol-chloroform extracted, precipitated and 

2 vacuum dried. The cDNA pellet was re-dissolved in 100/lcI 

3 TEN buffer (lOmM Tris pH 7.5, 0 . ImM EDTA, 2 5mM NaCl) 

4 and size fractionated using a Sephacryl S-500 HR column 

5 to remove unligated adapters and small cDNA fragments 

6 (<400bp) according to the manufacturers instructions. 

7 Fractions were checked by agarose gel electrophoresis 

8 and fractions containing cDNA less than 400 base pairs 

9 discarded, while the remaining fractions were pooled - 
L0 

LI (d) PCR amplification of oestrogen receptor DNA binding 

L2 domain. 

L3 

L4 The oestrogen receptor was PCR amplified from 5jul (ISO- 
LB 250ng) of bone marrow cDNA using 25pmol of each of the 
16 primers pDM12FOR (SEQ ID No 24) (5'- 

L7 AAAAGAATTCTGAATGTGTTATTTTAGCTCAGGTCACTCTGACCTGATTATCAAG 

L8 ACCCCACTTCACCCCCT) and pDM12BAK (SEQ ID No 25) (5'- 

L9 AAAAGCGGCCGCAGGGGGAGGAGGGTCCATGGAATCTGCCAAGGAG— 3 ' ) in 

20 two 50/*l reactions containing O.lmM dNTPs , 2.5 units 

21 Taq DNA polymerase, and lx PCR reaction buffer (lOmM 

22 Tris-HCl pH 9.0, 5mM KCl, 0.01% Triton X"-100, 1 . 5mM 

23 MgCl 2 ) (Promega Ltd, Southampton, U.K.). The pDM12F0R 

24 primer anneals to the 3 ' -end of the DNA binding domain 
2 5 of the oestrogen receptor and incorporates two stop 

2 6 codons, the 3 8 base pair consensus oestrogen receptor 

2 7 HRE sequence, and an EcoRI restriction site. The 

28 pDM12BAK primer anneals to the 5 ' -end of the DNA 

2 9 binding domain of the oestrogen receptor and 

3 0 incorporates the (Gly) 4 Ser linker and the NotI 
31 restriction site. 

32 

3 3 Reactions were overlaid with mineral oil and PCR 

3 4 carried out on a Techne PHC-3 thermal cycler for 3 0 

35 cycles of 94°C, 1 minute; 65°C, 1 minute; 72°C, 1 

3 6 minute. Reaction products were electrophoresed on an 
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1 agarose gel, excised and products purified from the gel 

2 using a Geneclean II kit according to the manufacturers 

3 instructions (BiolOl, La Jolla, California, U.S.A.) . 
4 

5 (e) Restriction digestion and ligation. 

6 

7 The PCR reaction appended NotI and EcoRI restriction 

8 sites, the (Gly) 4 Ser linker, stop codons and the 38 base 

9 pair oestrogen receptor target HRE nucleotide sequence 
10 to the oestrogen receptor DNA binding domain sequence 
xl ( S ee Figure 1) . The DNA PCR fragment and the target 

12 pDM6 vector (approximately 500ng) were NotI and EcoRI 

13 digested for 1 hour at 37 °C, and DNA purified by 

14 agarose gel electrophoresis and extraction with 

15 Geneclean II kit (BiolOl, La Jolla, California, 

16 U.S.A.). The oestrogen receptor DNA binding domain 

17 cassette was ligated into the Notl-EcoRI digested P DM6 

18 vector overnight at 16°C, phenol/ chloroform extracted 

19 and precipitated then electroporated into TGI E. coli 

20 (genotype: K12 , (Alac-pro) , supE, thi, hsD5/F ' traD3 6 , 

21 proA + B + , Lacl q , LacZAl5) and plated onto 2xTY agar 

22 plates supplemented with 1% glucose and lOO^g/ml 

23 ampicillin. Colonies were allowed to grow overnight at 

24 37 °C. Individual colonies were picked into 5ml 2xTY 

25 supplemented with 1% glucose and lOO^g/ml ampicillin 

26 and grown overnight at 37 °C. Double stranded phagemid 

27 DNA was isolated with a Wizard SV plasmid DNA isolation 

28 kit and the sequence confirmed with a Prism dyedeoxy 

29 cycle sequencing kit (Perkin-Elmer , Warrington, 

30 Lancashire, U.K.) using M13FOR (SEQ ID No 26) (5'- 

31 GTAAAACGACGGCCAGT) and Ml 3 REV (SEQ ID No 27) (5'- 

32 GGATAACAATTTCACACAGG) oligonucleotides. The pDM12 PDCP 
3 3 display vector DNA sequence between the Hindlll and 

34 EcoRI restriction sites is shown in Figure 1. 



35 



3 6 Example 2, Insertion of a random-primed human 
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1 lymphocyte cDNA into pDM12 and preparation of a master 

2 PDCP stock. 

3 

4 Libraries of peptides can be constructed by many 

5 methods known to those skilled in the art. The example 

6 given describes a method for constructing a peptide 

7 library from randomly primed cDNA, prepared from mRNA 

8 isolated from a partially purified cell population. 
9 

10 mRNA was isolated from approximately 10 9 human 

11 peripheral blood lymphocytes using a polyATract 1000 

12 mRNA isolation kit (Promega, Southampton, UK) . The cell 

13 pellet was resuspended in 4ml extraction buffer (4M 

14 guanidine thiocyanate, 25mM sodium citrate pH 7.1, 2% 

15 /3-mercaptoethanol ) . 8ml of pre-heated (70 °C) dilution 

16 buffer (6XSSC, lOmM Tris pH 7.4, ImM EDTA, 0.2 5% SDS, 

17 1% jS-mercaptoethanol) was added to the homogenate and 

18 mixed thoroughly by inversion. 10^1 of biotinylated 

19 oligo-dT (50 pmol/>l) was added, mixed and the mixture 
2 0 incubated at 70 °C for 5 minutes. The lymphocyte cell 

21 lysate was transferred to 6x 2ml sterile tubes and spun 

22 at 13,000 rpm in a microcentrifuge for ten minutes at 

2 3 ambient temperature to produce a cleared lysate. During 

2 4 this centrif ugation, streptavidin coated magnetic beads 

-2 5 were resuspended and 6ml transferred to a sterile 50ml 

2 6 Falcon tube, then placed in the magnetic stand in a 

27 horizontal position until all the beads were captured. 

28 The supernatant was carefully poured off and beads 

2 9 resuspended in 6ml 0.5xSSC, then the capture repeated. 

3 0 This wash was repeated 3 times, and beads resuspended 
31 in a final volume of 6ml 0.5XSSC. The cleared lysate 
3 2 was added to the washed beads, mixed by inversion and 
3 3 incubated at ambient temperature for 2 minutes, then 
34 beads captured in the magnetic stand in a horizontal 
3 5 position. The beads were resuspended gently in 2ml 

3 6 0.5xSSC and transferred to a sterile 2ml screwtop tube, 
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1 then captured again in the vertical position, and the 

2 wash solution discarded. This wash was repeated twice 

3 more. 1ml of DEPC-treated water was added to the beads 

4 and mixed gently. The beads were again captured and the 

5 eluted mRNA transferred to a sterile tube. 50^1 was 

6 electrophoresed to check the quality and quantity of 

7 mRNA, while the remainder was precipitated with 0.1 

8 volumes 3M sodium acetate and three volumes absolute 

9 ethanol at -80 °C overnight in 4 aliquots in sterile 
10 1.5ml screwtop tubes. 

11 

12 Double stranded cDNA was synthesised as described in 

13 Example 1 using 5/ig of lymphocyte mRNA as template. 

14 The cDNA was PCR amplified using oligonucleotides 

15 CDNAPCRFOR (SEQ ID No 28) (5'- 

16 AAAGCGGCCGCACTGGCCTGAGAGA) , which anneals to the cDNA 

17 synthesis oligonucleotide described in Example 1 which 

18 is present at the 3 " -end of all synthesised cDNA 

19 molecules incorporates a NotI restriction site, and an 

20 equimolar mixture of CDNAPCRBAK1, CDNAPCRBAK2 and 

2 1 CDNAPCRBAK3 . 

22 CDNAPCRBAK1: (SEQ ID No 29) 5'- 

2 3 AAAAGGCCCAGCCGGCCATGGCCCAGCCCACCACGCGTCCG , 

2 4 CDNAPCRBAK2: (SEQ ID No 30) 5'- 

2 5 AAAAGGCCCAGCCGGCCATGGCCCAGTCCCACCACGCGTCCG , 

26 CDNAPCRBAK3: (SEQ ID No 31) 5'- 

27 AAAAGGCCCAGCCGGCCATGGCCCAGTACCCACCACGCGTCCG) , 

2 8 all three of which anneal to the Sail adaptor sequence 
29 found at the 5 ' -end of the cDNA and incorporate a Sfil 

3 0 restriction site at the cDNA 5' -end. Ten PCR reactions 

31 were carried out using 2m1 of cDNA (50ng) per reaction 

32 as described in Example 1 using 25 cycles of 94 °C, l 

3 3 minute; 60 °C, 1 minute; 72 °C, 2 minutes. The reactions 

34 were pooled and a 2 0jul aliquot checked by agarose gel 

35 electrophoresis, the remainder was phenol/ chloroform 
3 6 extracted and ethanol precipitated and resuspended in 
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1 100/Ltl sterile water. 5^tg of pDM12 vector DNA and 

2 lymphocyte cDNA PCR product were Sfil-NotI digested 

3 phenol /chloroform extracted and small DNA fragments 

4 removed by size selection on Chromaspin 1000 spin 

5 columns (Clontech, Palo Alto, California, U.S.A.) by 

6 centrifugation at 700g for 2 minutes at room 

7 temperature. Digested pDM12 and lymphocyte cDNA were 

8 ethanol precipitated and ligated together for 16 hours 

9 at 16 °C. The ligated DNA was precipitated and 

10 electroporated in to TGI E. coll. Cells were grown in 

11 1ml SOC medium per cuvette used for 1 hour at 37°C, and 

12 plated onto 2xTY agar plates supplemented with 1% 

13 glucose and lOOMg/ml ampicillin. 10 -4 , 10" 5 and 10" 6 

14 dilutions of the electroporated bacteria were also 

15 plated to assess library size. Colonies were allowed to 

16 grow overnight at 30 °C. 2x10 s ampicillin resistant 

17 colonies were recovered on the agar plates. 

18 The bacteria were then scraped off the plates into 40ml 

19 2xTY broth supplemented with 20% glycerol, 1% glucose 

20 and lOO/zg/ml ampicillin. 5ml was added to a 20ml 2xTY 

21 culture broth supplemented with 1% glucose and 100/ig/ml 

22 ampicillin and infected with 10" kanamycin resistance 

23 units (kru) M13K07 helper phage at 37°C for 30 minutes 

24 without shaking, then for 30 minutes with shaking at 

25 200rpm. Infected bacteria were transferred to 200ml 

26 2xTY broth supplemented with 25/xg/ml kanamycin, 

27 I00^g/ml ampicillin, and 20juM IPTG, then incubated 

28 overnight at 37 °C, shaking at 200rpm. Bacteria were 

29 pelleted at 4 000rpm for 20 minutes in 50ml Falcon 

3 0 tubes, and 4 0ml 2 . 5M NaCl/2 0% PEG 6000 was added to 

31 200ml of particle supernatant, mixed vigorously and 

32 incubated on ice for 1 hour to precipitate PDCP 

33 particles. Particles were pelleted at llOOOrpm for 30 

34 minutes in 250ml Oakridge tubes at 4°C in a Sorvall 

35 RC5B centrifuge, then resuspended in 2ml PBS buffer 

3 6 after removing all traces of PEG/NaCl with a pipette, 
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1 then bacterial debris removed by a 5 minute 13 500rpm 

2 spin in a microcentrifuge. The supernatent was filtered 

3 through a 0.45^tm polysulfone syringe filter and stored 

4 at -20 C C. 

5 

6 Example 3 . Isolation of human immunoglobulin kappa 

7 light chains by repeated rounds of selection against 

8 ant i -human kappa antibody. 

9 

10 For the first round of library selection a 70xllnun NUNC 

11 Maxisorp Immunotube (Life Technologies, Paisley, 

12 Scotland U.K.) was coated with 2.5ml of 10/xg/ml of 

13 anti-human kappa antibody (Seralab, Crawley Down, 

14 Sussex, U.K.) in PBS for 2 hours at 37°C. The tube was 

15 rinsed three times with PBS (fill & empty) and blocked 

16 with 3ml PBS/2% BSA for 2 hours at 37 °C and washed as 

17 before. 4xl0 12 a.r.u. of pDM12- lymphocyte cDNA PDCP 

18 stock was added in 2ml 2% BSA/PBS/0 . 05% Tween 20, and 

19 incubated for 30 minutes on a blood mixer, then for 90 
2 0 minutes standing at ambient temperature. The tube was 

21 washed ten times with PBS/ 0.1% Tween 20, then a further 

22 ten times with PBS only. Bound particles were eluted in 

23 1ml of freshly prepared 0 . 1M triethylamine for 10 

2 4 minutes at ambient temperature on a blood mixer. Eluted 

25 particles were transferred to 0.5ml 1M Tris pH 7.4, 

2 6 vortex mixed briefly and transferred to ice. 
27 

28 Neutralised particles were added to 10ml log phase TGI 

29 E coli bacteria (optical density: OD 600nm 0.3-0.5) and 

30 incubated at 37 °C without shaking for 3 0 minutes, then 

31 with shaking at 200rpm for 30 minutes. 10 -3 , 1C 4 & 10" 5 

3 2 dilutions of the infected culture were prepared to 
3 3 estimate the number of particles recovered, and the 

3 4 remainder was spun at 4 000 rpm for 10 minutes, and the 

3 5 pellet resuspended in 3 00^1 2xTY medium by vortex 

3 6 mixing. Bacteria were plated onto 2xTY agar plates 
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1 supplemented with 1% glucose and 100/xg/ml ampicillin. 

2 Colonies were allowed to grow overnight at 3 0°C. 

3 

4 A PDCP stock was prepared from the bacteria recovered 

5 from the first round of selection, as described in 

6 Example 2 from a 100ml overnight culture. 2 50jul of the 

7 round 1 amplified PDCP stock was then selected against 

8 anti-human kappa antibody as described above with the 

9 tube was washed twelve times with PBS/0-1% Tween 20, 
10 then a further twelve times with PBS only. 

11 

12 To identify selected clones, eighty-eight individual 

13 clones recovered from the second round of selection 

14 were then tested by ELISA for binding to anti-human 

15 kappa antibody. Individual colonies were picked into 

16 100/il 2xTY supplemented with lOO/ng/ml ampicillin and 1% 

17 glucose in 96-well plates (Costar) and incubated at 

18 37 °C and shaken at 200rpm for 4 hours. 25^1 of each 

19 culture was transferred to a fresh 96-well plate, 

20 containing 25/*l/well of the same medium plus 10 7 k.r.u. 

21 M13K07 kanamycin resistant helper phage and incubated 
2 2 at 37 °C for 3 0 minutes without shaking, then incubated 
2 3 at 3 7 °C and shaken at 200rpm for a further 3 0 minutes. 

24 160^1 of 2xTY supplemented with I00^tg/ml ampicillin, 

25 25//g/ml kanamycin, and 20ptM IPTG was added to each well 

2 6 and particle amplification continued for 16 hours at 

27 37 °C while shaking at 200rpm. Bacterial cultures were 

28 spun in microtitre plate carriers at 2000g for 10 

29 minutes at 4°C in a benchtop centrifuge to pellet 

30 bacteria and culture supernatant used for ELISA. 
31 

3 2 A Dynatech Immulon 4 ELISA plate was coated with 

33 200ng/well anti-human kappa antibody in lOO^tl /well PBS 

34 for one hour at 37 °C. The plate was washed 2x200/xl/well 

35 PBS and blocked for 1 hour at 37 °C with 200/il/well 2% 

36 BSA/PBS and then washed 2x200/xl/well PBS. 50^1 PDCP 
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1 culture supernatant was added to each well containing 

2 50/xl/well 4% BSA/PBS/O. l%Tween 20, and allowed to bind 

3 for 1 hour at ambient temperature. The plate was washed 

4 three times with 200/il/well PBS/ 0.1% Tween 20, then 

5 three times with 200/il/well PBS. Bound PDCPs were 

6 detected with 100/il/well, 1:5000 diluted anti-M13-HRP 

7 conjugate (Pharmacia) in 2% BSA/PBS/0 . 05% Tween 20 for 

8 1 hour at ambient temperature and the plate washed six 

9 times as above. The plate was developed for 5 minutes 

10 at ambient temperature with lOOjtil/well freshly prepared 

11 TMB (3 , 3 ' , 5, 5 ' -Tetramethylbenzidine) substrate buffer 

12 (0.005% H 2 0 2 , O.lmg/ml TMB in 24mM citric acid/52mM 

13 sodium phosphate buffer pH 5.2). The reaction was 

14 stopped with lOOjul/well 12.5% H 2 S0 4 and read at 450nm. 

15 (ELISA data for binding clones is shown in Figure 2) . 
16 

17 These clones were then sequenced with Ml 3 REV primer 

18 (SEQ ID No 27) as in Example 1. The sequence of two of 

19 the clones isolated is shown in Figure 3 (see SEQ ID 

2 0 Nos 7 to 10) . 
21 

22 Example 4. Construction of the pDM14 N-terminal display 

23 vector 

24 

25 It would be useful to design vectors that contain a 

26 second DBD binding sequence, such as a second oestrogen 

27 receptor HRE sequence, thus allowing the display of 

28 increased numbers of peptides per PDCP. Peale et al. 

29 (1988, Proc. Natl. Acad. Sci. USA 85: 1038-1042) 

3 0 describe a number of oestrogen receptor HRE sequences. 

31 These sequences were used to define an HRE sequence, 

32 which differs from that cloned in pDM12 , which we used 
3 3 to create a second N-terminal display vector (pDM14) . 
34 The oligonucleotide: 5 ' -AAAAGAATTCGAGGTTACATTAACTTTGTT 
3 5 CCGGTCAGACTGACCCAAGTCGACCTGAATGTGTTATTTTAG-3 ' (SEQ ID 
3 6 No 32) was synthesised and used to mutagenise pDM12 by 
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1 PCR with pDM12BAK oligonucleotide as described in 

2 Example 1 using lOOng pDM12 vector DNA as template. The 

3 resulting DNA fragment, which contained the oestrogen 

4 receptor DBD and two HRE sequences separated by a Sail 

5 restriction enzyme site, was Notl-EcoRI restriction 

6 enzyme digested and cloned into Notl-EcoRI digested 

7 pDMl2 vector DNA as described in Example 1 to create 

8 pDM14. The sequence of pDM14 between the Hindlll and 

9 EcoRI restriction enzyme sites was checked by DNA 

10 sequencing. The final vector sequence between these two 

11 sites is shown in Figure 4 (see SEQ ID Nos 11 and 12) . 
12 

13 Example 5. Construction of the pDM16 C-terminal display 

14 vector 

15 

16 In order to demonstrate the display of peptides fused 

17 to the C-terminus of a DBD on a PDCP a suitable vector, 

18 pDM16, was created. 
19 

20 In pDM16 the pelB leader DNA sequence is fused directly 

21 to the oestrogen receptor DBD sequence removing the 

22 multiple cloning sites and the Gly 4 Ser linker DNA 

23 sequence found in pDM12 and pDM14, which are appended 

24 to the C-terminal end of the DBD sequence upstream of 

2 5 the HRE DNA sequence. 
26 

27 To create this vector two separate PCR reactions were 

28 carried out on a Techne Progene thermal cycler for 30 
cycles of 94°C, 1 minute; 60°C, 1 minute; 72°C, 1 
minute. Reaction products were electrophoresed on an 
agarose gel, excised and products purified from the gel 
using a Mermaid or Geneclean II kit, respectively, 

3 3 according to the manufacturers instructions (BiolOl, La 
34 Jolla, California, U.S.A.). 

35 

36 In the first, the 5 ' -untranslated region and pelB 



29 
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1 leader DNA sequence was amplified from lOOng of pDMl2 

2 vector DNA using 50pmol of each of the oligonucleotides 

3 pelBFOR (SEQ ID No 33) (5 ' -CCTTGGCAGATTCCATCT 

4 CGGCCATTGCCGGC-3 ' ) and M13REV (SEQ ID NO 27) (see 

5 above) in a 100/xl reaction containing O.lmM dNTPs, 2.5 

6 units Taqplus DNA polymerase, and lx High Salt PCR 

7 reaction buffer (20mM Tris-HCl pH 9.2, 60mM KC1, 2mM 

8 MgCl 2 ) (Stratagene Ltd, Cambridge, U.K.). 
9 

10 In the second, the 3 ' -end of the pelB leader sequence 

11 and the oestrogen receptor DBD was amplified from lOOng 

12 of pDM12 vector DNA using SOpmol of each of the 

13 oligonucleotides pelBBAK (SEQ ID No 34) (5'-CCGGCAA 

14 TGGCCGAGATGGAATCTGCCAAGG-3 ' ) and pDM16FOR (SEQ ID No 
!5 35) ( 5 ' -TTTTGTCGACTCAATCAGTTATGCGGCCGCCAGCTGCAGG 

16 AGGGCCGGCTGGGCCGACCCTCCTCCCCCAGACCCCACTTCACCCC-3 ' ) in a 

17 lOOttl reaction containing O.lmM dNTPs, 2.5 units 

18 Taqplus DNA polymerase, and lx High Salt PCR reaction 

19 buffer (Stratagene Ltd, Cambridge, U.K.). Following gel 

20 purification both products were mixed together and a 

21 final round of PCR amplification carried out to link 

22 the two products together as described above, in a 

23 100/xl reaction containing O.lmM dNTPs, 2.5 units Taq 

24 DNA polymerase, and lx PCR reaction buffer (lOmM Tris- 

25 HC1 pH 9.0, 5mM KC1, 0.01% Triton X"-100, 1 . 5mM MgCl 2 ) 

26 (Pr omega Ltd, Southampton, U.K.). 
27 

28 The resulting DNA fragment, was Hindlll-Sall 

29 restriction enzyme digested and cloned into Hindlll- 

3 0 Sail digested pDM14 vector DNA as described in Example 

31 l to create pDMl6. The sequence of pDMIS between the 

32 Hindlll and EcoRI restriction enzyme sites was checked 
3 3 by DNA sequencing. The final vector sequence between 

34 these two sites is shown in Figure 5 (see SEQ ID Nos 13 

35 and 14) . 
36 



SUBSTITUTE SHEET (RULE 26) 



WO 99/11785 PCT/GB98/02630 

42 

1 Example 6. Display of the C-terminal fragment of human 

2 N-cadherin on the surface of a PDCP 

3 

4 cDNA libraries of peptides can be constructed by many 

5 methods known to those skilled in the art. One commonly 

6 used method for constructing a peptide library uses 

7 oligo dT primed cDNA, prepared from polyA+ mRNA. In 

8 this method the first-strand synthesis is carried out 

9 using an oligonucleotide which anneals to the 3 ' -end 

10 polyA tail of the mRNA composed of T n (where n is 

11 normally between 10 and 20 bases) and a restriction 

12 enzyme site such as NotI to facilitate cloning of cDNA. 

13 The cDNA cloned by this method is normally composed of 

14 the polyA tail, the 3'- end untranslated region and the 

15 c-terminal coding region of the protein. As an example 

16 of the C-terminal display of peptides on a PDCP, a 

17 human cDNA isolated from a library constructed by the 

18 above method was chosen. 



20 



The protein N-cadherin is a cell surface molecule 

21 involved in cell-cell adhesion. The C-terminal 

22 cytoplasmic domain of the human protein (Genbank 

2 3 database accession number: M34064) is recognised by a 

24 commercially available monoclonal antibody which was 

25 raised against the C-terminal 23 amino acids of chicken 

26 N-cadherin (SIGMA catalogue number: C-1821) . The 1 . 4kb 

27 human cDNA fragment encoding the C-terminal 99 amino 

28 acids, 3'- untranslated region and polyA tail (NotI 

29 site present at the 3 ' -end of the polyA tail) was 

30 amplified from approximately 20ng pDM7-NCAD#c with 

31 25pmol of each oligonucleotide M13FOR (SEQ ID No 26) 

32 and CDNPCRBAK1 (SEQ ID No 29) (see above) in a 50/il 

3 3 reaction containing 0 . ImM dNTPs, 2.5 units Taqplus DNA 

34 polymerase, and lx High Salt PCR reaction buffer (20mM 

35 Tris-HCl pH 9.2, 60mM KC1, 2mM MgCl 2 ) (Stratagene Ltd, 

36 Cambridge, U.K.) on a Techne Progene thermal cycler for 
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1 30 cycles of 94°C, 1 minute; 60°C, 1 minute; 72°C, 1 

2 minute. Following gel purification and digestion with 

3 Sfil and NotI restiction enzymes, the PCR product was 

4 cloned into pDM16 using an analogous protocol as 

5 described in Example 1. 
6 

7 Clones containing inserts were identified by ELISA of 

8 96 individual PDCP cultures prepared as described in 

9 Example 3 . A Dynatech Immulon 4' ELISA plate was coated 

10 with 1:250 diluted anti-pan cadherin monoclonal 

11 antibody in 100/xl /well PBS overnight at 42C. The plate 

12 was washed 3x200^1^611 PBS and blocked for 1 hour at 

13 37°C with 200/xl/well 2% Marvel non-fat milk powder/PBS 

14 and then washed 2x200/xl/well PBS. 50/aI PDCP culture 

15 supernatant was added to each well containing 50^1/well 

16 4% Marvel/ PBS, and allowed to bind for 1 hour at 

17 ambient temperature. The plate was washed three times 

18 with 200/il/well PBS/0.1% Tween 20, then three times 

19 with 200/xl/well PBS. Bound PDCPs were detected with 

20 100/il/well, 1:5000 diluted anti-M13-HRP conjugate 

21 (Pharmacia) in 2% Marvel /PBS for 1 hour at ambient 

22 temperature and the plate washed six times as above. 

23 The plate was developed for 15 minutes at ambient 

24 temperature with 100/o.l/well freshly prepared TMB 

25 (3,3 ' ,5,5 '-Tetramethylbenzidine) substrate buffer 

26 (0.005% H 2 0 2 , O.lmg/ml TMB in 24mM citric acid/52mM 

27 sodium phosphate buffer pH 5.2). The reaction was 

28 stopped with 100jil/well 12.5% H 2 S0 4 and read at 450nm. 

29 The nucleotide sequence of an ELISA positive clone 

30 insert and DBD junction was checked by DNA sequencing 

31 using oligonucleotides M13FOR (SEQ ID No 26) (see 
3 2 Example 1) and ORSEQBAK (SEQ ID No 36) (5'- 

3 3 TGTTGAAACACAAGCGCCAG-3 ' ) . 
34 

3 5 A fifty-fold concentrated stock of C-terminal N- 

3 6 cadherin PDCP particles was prepared by growing the un- 
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1 infected TGI clone in 1ml 2xTY culture broth 

2 supplemented with 1% glucose and 100jug/ml ampicillin 

3 for five hours at 37 °C, shaking at 200rpm and infecting 

4 with 10 8 kanamycin resistance units (kru) M13K07 helper 

5 phage at 37°C for 30 minutes without shaking, then for 

6 30 minutes with shaking at 200rpm. Infected bacteria 

7 were transferred to 2 0ml 2xTY broth supplemented with 

8 25^g/ml kanamycin, 100/ig/ml ampicillin, and 20/iM IPTG, 

9 then incubated overnight at 30 °C, shaking at 200rpm. 
20 Bacteria were pelleted at 4000rpm for 20 minutes in 
11 50ml Falcon tubes, and 4ml 2.5M NaCl/20% PEG 6000 was 
22 added to 2 0ml of PDCP supernatant, mixed vigorously and 
13 incubated on ice for 1 hour to precipitate particles. 
14 

25 The particles were pelleted at llOOOrpm for 30 minutes 

16 in 50ml Oakridge tubes at 4°C in a Sorvall RC5B 

17 centrifuge, then resuspended in PBS buffer after 

18 removing all traces of PEG/NaCl with a pipette, then 

19 bacterial debris removed by a 5 minute 13 500rpm spin in 

20 a microcentrifuge. The supernatant was filtered through 

21 a 0.45,um polysulfone syringe filter. The concentrated 

22 stock was two-fold serially diluted and used in ELISA 

23 against plates coated with anti-pan-cadherin antibody 

24 as described above (see Figure 6) . 
25 

2 6 This example demonstrates the principle of C-terminal 

27 display using PDCPs, that C-terminal DBD-peptide fusion 

28 PDCPs can be made which can be detected in ELISA, and 

29 the possibility that oligo dT primed cDNA libraries may 

3 0 be displayed using this method. 
31 

32 Example 7 . Display of in vivo biotinylated c-terminal 

33 domain of human propionyl CoA carboxylase on the 

34 surface of a PDCP 

35 

36 Example 6 shows that the C-terminal domain of human N- 



SUBST1TUTE SHEET (RULE 26) 



WO 99/11785 



45 



PCT/GB98/02630 



1 cadherin can be expressed on the surface of a PDCP as a 

2 C-terminal fusion with the DBD. Here it is shown that 

3 the C-terminal domain of another human protein 

4 propionyl CoA carboxylase alpha chain (Genbank 

5 accession number: X14608) can similarly be displayed, 

6 suggesting that this methodology may be general. 
7 

8 The alpha sub-unit of propionyl CoA carboxylase alpha 

9 chain (PCC) contains 703 amino acids and is normally 

10 biotinylated at position 669. It is demonstrated that 

11 the PCC peptide displayed on the PDCP is biotinylated, 

12 as has been shown to occur when the protein is 

13 expressed in bacterial cells (Leon-Del-Rio & Gravel; 

14 1994, J. Biol. Chem. 37, 22964-22968). 
15 

16 The 0.8kb human cDNA fragment of PCC alpha encoding the 

17 C-terminal 95 amino acids, 3'- untranslated region and 

18 polyA tail (NotI site present at the 3 * -end of the 

19 polyA tail) was amplified and cloned into pDM16 from 

20 approximately 20ng pDM7-PCC#C with 25pmol of each 

21 oligonucleotide M13FOR (SEQ ID No 26) and CDNPCRBAK1 

22 (SEQ ID No 29) as described in Example 6. 
23 

24 Clones containing inserts were identified by ELISA as 

25 described in Example 6, except that streptavidin was 

26 coated onto the ELISA plate at 250ng/well, in place of 

27 the anti-cadherin antibody. The nucleotide sequence of 

28 an ELISA positive clone insert and DBD junction was 

29 checked by DNA sequencing using oligonucleotides M13FOR 

30 (SEQ ID No 26) and ORSEQBAK (SEQ ID No 3 6) (see above). 

31 A fifty-fold concentrated stock of C-terminal PCC PDCP 
3 2 particles was prepared and tested in ELISA against 

3 3 streptavidin as described in Example 6 (see Figure 7) . 
34 

3 5 This example shows not only that the peptide can be 

36 displayed as a C-terminal fusion on a PDCP, but also 
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l that in vivo modified peptides can be displayed. 
2 

3 Example 8. Construction of a human scFv PDCP display 

4 library 

5 

6 This example describes the generation of a human 

7 antibody library of scFvs made from an un- immunised 

8 human. The overall strategy for the PCR assembly of 

9 scFv fragments is similar to that employed by Marks, J. 

10 D. et al. 1991, J. Mol. Biol. 222: 581-597. The 

11 antibody gene oligonucleotides used to construct the 

12 library are derived from the Marke et al., paper and 

13 from sequence data extracted from the Kabat database 

14 (Kabat, E. A. et al., Sequences of Proteins of 

15 Immunological Interest. 4 th edition. U.S. Department of 

16 Health and Human Services. 1987) . The three linker 

17 oligonucleotides are described by Zhou et al. (1994, 

18 Nucleic Acids Res., 22: 888-889), all oligonucleotides 

19 used are detailed in Table 1. 
20 

21 First, mRNA was isolated from peripheral blood 

22 lymphocytes and cDNA prepared for four repertoires of 

23 antibody genes IgD, IgM, Ig/c and IgX, using four 

24 separate cDNA synthesis primers. VH genes were 

25 amplified from IgD and IgM primed cDNA, and VL genes 

26 were amplified from Ig* and IgX primed cDNA. A portion 

27 of each set of amplified heavy chain or light chain DNA 

28 was then spliced with a separate piece of linker DNA 

29 encoding the 15 amino acids (Gly 4 Ser) 3 (Huston, J. S. 

30 et al. 1989, Gene, 77: 61). The 3 ' -end of the VH PCR 

31 products and the 5 ' -end of the VL PCR products overlap 

32 the linker sequence as a result of incorporating linker 

33 sequence in the JH, Va and VX family primer sets (Table 
3 4 1) . Each VH- linker or linker-VL DNA product was then 

35 spliced with either VH or VL DNA to produce the primary 

3 6 scFv product in a VH-linker-VL configuration. This scFv 
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1 product was then amplified and cloned into pDM12 as a 

2 Sfil-NotI fragment, electroporated into TGI and a 

3 concentrated PDCP stock prepared. 

4 

5 mRNA isolation and cDNA synthesis. 

6 Human lymphocyte mRNA was purified as described in 

7 Example 2 . Separate cDNA reactions were performed with 

8 IGDCDNAFOR (SEQ ID No 37), IGMCDNAFOR (SEQ ID No 38), 

9 IGKCDNAFOR (SEQ ID No 39) and IGXCDNAFOR (SEQ ID No 40) 

10 oligonucleotides. 50pmol of each primer was added to 

11 approximately 5fj.q of mRNA in 20^1 of nuclease free 

12 water and heated to 70 °C for 5 minutes and cooled 

13 rapidly on ice, then made up to a final reaction volume 

14 of 100/il containing 50mM Tris pH 8.3, 75mM KCl, 3mM 

15 MgCl 2 , lOmM DTT, 0 . 5mM dNTPs , and 2000 units of 

16 Superscript II reverse transcriptase (Life 

17 Technologies, Paisley, Scotland, U.K.). The reactions 

18 were incubated at 37 °C for two hours, then heated to 

19 95 °C for 5 minutes. 
20 

21 Primary PCRs. 

22 For the primary PCR amplifications separate 

23 amplifications were set up for each family specific 

24 primer with either an equimolar mixture of the JHFOR 

25 primer set (SEQ ID Nos 41 to 44) for IgM and IgD cDNA, 

2 6 or with SCFV/cFOR (SEQ ID No 51) or SCFVXFOR (SEQ ID No 

27 52) for IgK or IgX cDNA respectively e.g. VH1BAK and 

28 JHFOR set; V«2BAK (SEQ ID No 54) and SCFVkFOR (SEQ ID 

29 No 51) ; V\3aBAK (SEQ ID No 66) and SCFVXFOR (SEQ ID No 

3 0 52) etc. Thus, for IgM, IgD and Ig/c cDNA six separate 

31 reactions were set up, and seven for IgX cDNA. A 50/ul 

32 reaction mixture was prepared containing 2^1 cDNA, 

3 3 2 5pmol of the appropriate FOR and BAK primers, O.lmM 

34 dNTPs, 2.5 units Taqplus DNA polymerase, and lx High 

35 Salt PCR reaction buffer (20mM Tris-HCl pH 9.2, 60mM 

36 KCl, 2mM MgCl 2 ) (Stratagene Ltd, Cambridge, U.K.). 
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1 Reactions were amplified on a Techne Progene thermal 

2 cycler for 3 0 cycles of 94°C, 1 minute; 60°C, 1 minute; 

3 72°C, 2 minutes, followed by 10 minutes at 72°C. Fifty 

4 microlitres of all 25 reaction products were 

5 electrophoresed on an agarose gel, excised and products 

6 purified from the gel using a Geneclean II kit 

7 according to the manufacturers instructions (BiolOl, La 

8 Jolla, California, U.S.A.) . All sets of IgD, IgM, IgK 

9 or IgX reaction products were pooled to produce VH or 

10 VL DNA sets for each of the four repertoires. These 

11 were then adjusted to approximately 20ng//xl. 
12 

13 Preparation of linker. 

14 Linker product was prepared from eight 100/il reactions 

15 containing 5ng LINKAMP3T (SEQ ID No 76) template 

16 oligonucleotide, 50pmol of LINKAMP3 (SEQ ID No 74) and 

17 LINKAMP5 (SEQ ID No 75) primers, 0 . ImM dNTPs, 2.5 units 

18 Taqplus DNA polymerase, and lx High Salt PCR reaction 

19 buffer (20mM Tris-HCl pH 9.2, 60mM KCl, 2mM MgCl 2 ) 

20 (Stratagene Ltd, Cambridge, U.K.). Reactions were 

21 amplified on a Techne Progene thermal cycler for 30 

22 cycles of 94°C, 1 minute; 60 C C, 1 minute; 72°C, 1 

23 minute, followed by 10 minutes at 72°C. All reaction 

24 product was electrophoresed on a 2% low melting point 

25 agarose gel, excised and products purified from the gel 

2 6 using a Mermaid kit according to the manufacturers 

27 instructions (BiolOl, La Jolla, California, U.S.A.) and 

28 adjusted to 5ng/Ml- 
29 

3 0 First stage linking. 

31 Four linking reactions were prepared for each 

3 2 repertoire using 2 0ng of VH or VL DNA with 5ng of 

3 3 Linker DNA in 100^1 reactions containing (for IgM or 

34 IgD VH) 50pmol of LINKAMPFOR and VH1-6BAK set, or, 

3 5 50pmol LINKAMPBAK and either SCFV/cFOR (Ig/c) or SCFVXFOR 

36 (IgX), O.lmM dNTPs, 2.5 units Tag DNA polymerase, and 
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1 lx PCR reaction buffer (lOmM Tris-HCl pH 9.0, 5mM KCX, 

2 0.01% Triton X*-100, 1.5mM MgCl 2 ) (Promega Ltd, 

3 Southampton, U.K.). Reactions were amplified on a 

4 Techne Progene thermal cycler for 30 cycles of 94°C, 1 

5 minute; 60°C, 1 minute; 72°C, 2 minutes, followed by 10 

6 minutes at 72 °C. Reaction products were electrophoresed 

7 on an agarose gel, excised and products purified from 

8 the gel using a Geneclean II kit according to the 

9 manufacturers instructions (BiolOl, La Jolla, 
10 California, U.S.A.) and adjusted to 20ng//il. 
11 

12 Final linking and reamplif ication. 

13 To prepare the final scFv DNA products, five 100/xl 

14 reactions were performed for VH-LINKER plus VL DNA, 

15 and, five 100/il reactions were performed for VH plus 

16 LINKER— VL DNA for each of the four final repertoires 

17 (IgM VH-VK, VH-VX; IgD VH-VK, VH-VX) as described in 

18 step (d) above using 20ng of each component DNA as 

19 template. Reaction products were electrophoresed on an 

20 agarose gel, excised and products purified from the gel 

21 using a Geneclean II kit according to the manufacturers 

22 instructions (BiolOl, La Jolla, California, U.S.A.) and 

23 adjusted to 20ng/jil- Each of the four repertoires was 

24 then re-amplified in a 100/il reaction volume containing 
2 5 2ng of each linked product, with 50pmol VHBAK1-6 (SEQ 

26 ID Nos 53 to 58) and either the JKFOR (SEQ ID Nos 66 to 

27 70) or JXFOR (SEQ ID Nos 71 to 73) primer sets, in the 

2 8 presence of 0 . ImM dNTPs, 2.5 units Taq DNA polymerase, 

29 and lx PCR reaction buffer (lOmM Tris-HCl pH 9.0, 5mM 

30 KC1, 0.01% Triton x"-100, 1.5mM MgCl,) (Promega Ltd, 

31 Southampton, U.K.). Thirty reactions were performed per 

3 2 repertoire to generate enough DNA for cloning. 

3 3 Reactions were amplified on a Techne Progene thermal 

34 cycler for 25 cycles of 94 °C, 1 minute; 65°C, 1 minute; 

3 5 72 °C, 2 minutes, followed by 10 minutes at 72 °C. 

3 6 Reaction products were phenol-chloroform extracted, 
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1 ethanol precipitated, vacuum dried and re-suspended in 

2 80/il nuclease free water, 
3 

4 cloning into pDM12. 

5 Each of the four repertoires was Sfil-NotI digested, 

6 and electrophoresed on an agarose gel, excised and 

7 products purified from the gel using a Geneclean II kit 

8 according to the manufacturers instructions <Biol01, La 

9 Jolla, California, U.S.A.)- Each of the four 

10 repertoires was ligated overnight at 16°C in 140^1 with 

11 10/ig of Sfil-NotI cut pDM12 prepared as in Example 2, 

12 and 12 units of T4 DNA ligase (Life Technologies, 

13 Paisley, Scotland, U.K.). After incubation the 

14 ligations were adjusted to 200^1 with nuclease free 

15 water, and DNA precipitated with l/xl 20mg/ml glycogen, 

16 100^1 7 . 5M ammonium acetate and 900/il ice-cold (-20°C) 

17 absolute ethanol, vortex mixed and spun at I3,000rpm 

18 for 20 minutes in a microfuge to pellet DNA. The 

19 pellets were washed with 500/xl ice-cold 70% ethanol by 

20 centrifugation at 13,000rpm for 2 minutes, then vacuum 

21 dried and re-suspended in 10/xl DEPC-treated water. 1^1 

22 aliquots of each repertoire was electroporated into 

23 80^1 E. coli (TGI) . Cells were grown in 1ml SOC medium 

24 per cuvette used for 1 hour at 37°C, and plated onto 

25 2xTY agar plates supplemented with 1% glucose and 

26 lOOptg/ml ampicillin. 10~\ 10' 5 and 10" 6 dilutions of the 

27 electroporated bacteria were also plated to assess 

28 library size. Colonies were allowed to grow overnight 

29 at 3 0°C. Cloning into Sfil-NotI digested pDM12 yielded 

30 an IgM-K/X repertoire of 1.16X10 9 clones, and an IgD-/t/X 

31 repertoire of 1.21X10 9 clones. 
32 

3 3 preparation of PDCP stock. 

34 Separate PDCP stocks were prepared for each repertoire 

35 library. The bacteria were then scraped off the plates 
3 6 into 3 0ml 2xTY broth supplemented with 2 0% glycerol, 1% 
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1 glucose and lOOjug/ml ampicillin. 3ml was added to a 

2 50ml 2xTY culture broth supplemented with 1% glucose 

3 and 100/xg/ml ampicillin and infected with 10 11 kanamycin 

4 resistance units (kru) M13K07 helper phage at 37 "C for 

5 30 minutes without shaking, then for 3 0 minutes with 

6 shaking at 200rpm. Infected bacteria were transferred 

7 to 500ml 2xTY broth supplemented with 25jxg/ml 

8 kanamycin, 100/tg/ml ampicillin, and 20/xM IPTG, then 

9 incubated overnight at 30°C, shaking at 200rpnu 

10 Bacteria were pelleted at 4Q00rpm for 20 minutes in 

11 50ml Falcon tubes, and 80ml 2.5M NaCl/20% PEG 6000 was 

12 added to 400ml of particle supernatant, mixed 

13 vigorously and incubated on ice for 1 hour to 

14 precipitate PDCP particles. Particles were pelleted at 

15 llOOOrpm for 30 minutes in 250ml Oakridge tubes at 4°C 

16 in a Sorvall RC5B centrifuge, then resuspended in 40ml 

17 water and 8ml 2 . 5M NaCl/20% PEG 6000 added to 

18 reprecipitate particles, then incubated on ice for 2 0 

19 minutes. Particles were again pelleted at llOOOrpm for 

20 30 minutes in 50ml Oakridge tubes at 4°C in a Sorvall 

21 RC5B centrifuge, then resuspended in 5ml PBS buffer, 

22 after removing all traces of PEG/NaCl with a pipette. 

23 Bacterial debris was removed by a 5 minute I3500rpm 

2 4 spin in a microcentrifuge. The supernatant was filtered 

25 through a 0.45ptm polysulfone syringe filter, adjusted 

26 to 20% glycerol and stored at -70 °C. 
27 

28 Example 9. Isolation of binding activity from a N- 

29 terminal display PDCP library of human scPvs 

30 

31 The ability to select binding activities to a target of 

32 interest from a human antibody library is important due 

3 3 to the possibility of generating therapeutic human 

34 antibodies. In addition, such libraries allow the 

35 isolation of antibodies to targets which cannot be used 
3 6 for traditional methods of antibody generation due to 
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1 toxicity, low immunogenicity or ethical considerations. 

2 In this example we demonstrate the isolation of 

3 specific binding activities against a peptide antigen 

4 from a PDCP library of scFvs from an un-immunised 

5 human . 
6 

7 The generation of the library, used for the isolation 

8 of binding activities in this example, is described in 

9 Example 8. 
20 

21 Substance P is an eleven amino acid neuropeptide 

12 involved in inflammatory and pain responses in vivo. It 

23 has also been implicated in a variety of disorders such 

14 as psoriasis and asthma amongst others (Misery, L. 

15 1997, Br. J. Dertmatol., 137: 843-850; Maggi, C. A. 

16 1997, Regul. Pept. 70: 75-90; Choi, D. C. & Kwon, 0. J. , 

17 1998, Curr. Opin. Pulm. Med., 4: 16-24). Human 

18 antibodies which neutralise this peptide may therefore 

19 have some therapeutic potential. As this peptide is too 
2 0 small to coat efficiently on a tube, as described in 

21 Example 3, selection of binding activities was 

22 performed in-solution, using N-terminal biotinylated 
2 3 substance P and capturing bound PDCP particles on 

24 streptavidin-coated magnetic beads. 
25 

2 6 Enrichment for substance P binding PDCP particles. 

27 An aliquot of approximately 10 13 a.r.u. IgM and IgD scFv 

28 library stock was mixed with l^tg biotinylated substance 

29 P in 800jul 4% BSA/0.1% Tween 20/PBS, and allowed to 

3 0 bind for two hours at ambient temperature. Bound PDCPs 

31 were then captured onto 1ml of BSA blocked streptavidin 

32 coated magnetic beads for 10 minutes at ambient 

33 temperature. The beads were captured to the side of the 

34 tube with a magnet (Promega) , and unbound material 

35 discarded. The beads were washed eight times with 1ml 

36 PBS/0.1% Tween 20/ lOjug/ml streptavidin, then two times 
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1 with lml of PBS by magnetic capture and removal of wash 

2 buffer. After the final wash bound PDCPs were eluted 

3 with lml of freshly prepared 0.1M triethylamine for 10 

4 minutes, the beads were captured, and eluted particles 

5 transferred to 0.5ml 1M Tris-HCl pH 7.4. Neutralised 

6 particles were added to 10ml log phase TGI E. coli 

7 bacteria and incubated at 37 °C without shaking for 30 

8 minutes, then with shaking at 200rpm for 30 minutes. 

9 10' 3 , 10" 4 St 10" 5 dilutions of the infected culture were 

10 prepared to estimate the number of particles recovered, 

11 and the remainder was spun at 4000 rpm for 10 minutes, 

12 and the pellet resuspended in 300,ul 2xTY medium by 

13 vortex mixing. Bacteria were plated onto 2xTY agar 

14 plates supplemented with 1% glucose and 100/zg/ml 

15 ampicillin. Colonies were allowed to grow overnight at 

16 3 0°C. A 100-fold concentrated PDCP stock was prepared 

17 from a 200ml amplified culture of these bacteria as 

18 described above, and 0.5ml used in as second round of 

19 selection with 500ng biotinylated substance P. For this 

20 round 100/zg/ml streptavidin was included in the wash 

21 buffer. 
22 

23 ELISA identification of binding clones. 

24 Binding clones were identified by ELISA of 96 

25 individual PDCP cultures prepared as described in 

26 Example 3 from colonies recovered after the second 

27 round of selection. A Dynatech Immulon 4 ELISA plate 

28 was coated with 200ng/well streptavidin in 100^1 /well 

29 PBS for 1 hour at 3 7 °C. The plate was washed 

30 3x2 00/il/well PBS and incubated with lOng/well 

31 biotinylated substance P in 100/il /well PBS for 30 

32 minutes at 37 °c The plate was washed 3x2 00/xl/well PBS 

33 and blocked for l hour at 37°C with 200/il/well 2% 

34 Marvel non-fat milk powder/PBS and then washed 

35 2x200^1/well PBS. 50jtil PDCP culture supernatant was 

36 added to each well containing 50/zl/well 4% Marvel/PBS, 
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1 and allowed to bind for 1 hour at ambient temperature. 

2 The plate was washed three times with 200fil/well 

3 PBS/0.1% Tween 20, then three times with 200/xl/well 

4 PBS . Bound PDCPs were detected with 100^1/well, 1:5000 

5 diluted anti-M13-HRP conjugate (Pharmacia) in 2% 

6 Marvel/ PBS for 1 hour at ambient temperature and the 

7 plate washed six times as above. The plate was 

8 developed for 10 minutes at ambient temperature with 

9 100ptl/well freshly prepared TMB (3, 3', 5, 5'- 

10 Tetramethylbenzidine) substrate buffer (0.005% H 2 0 2 , 

11 O.img/ml TMB in 24mM citric acid/52mM sodium phosphate 

12 buffer pH 5.2). The reaction was stopped with 

13 100/xl/well 12.5% H 2 S0 4 and read at 450nm. Out of 96 

14 clones tested, 10 gave signals greater than twice 

15 background (background = 0.05). 
16 

17 Characterization of a binding clone. 

18 A 50-fold concentrated PDCP stock was prepared from a 

19 100ml amplified culture of a single ELISA positive 

20 clone as described above. 10^1 per well of this stock 

21 was tested in ELISA as described above for binding to 

22 streptavidin , streptavidin-biotinylated-substance P and 

2 3 streptavidin-biotinylated-CGRP (N-terminal 

24 biotinylated) . Binding was only observed in 

25 streptavidin-biotinylated-substance P coated wells 

26 indicating that binding was specific. In addition, 

27 binding to streptavidin-biotinylated substance P was 

28 completely inhibited by incubating the PDCP with 1/zg/ml 

29 free substance P (see Figure 8) . The scFv VH (SEQ ID 

30 Nos 15 and 16) and VL (SEQ ID Nos 17 and 18) DNA and 

31 amino acid sequence was determined by DNA sequencing 

32 with oligonucleotides M13REV (SEQ ID No27) and ORSEQFOR 

3 3 (SEQ ID No 36) and is shown in Figure 9. 
34 

3 5 The results indicate that target binding activities can 

3 6 be isolated from PDCP display libraries of human scFv 
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1 fragments . 
2 

3 Example 10 

4 In another example the invention provides methods for 

5 screening a DNA library whose members require more than 

6 one chain for activity, as required by, for example, 

7 antibody Fab fragments for ligand binding. To increase 

8 the affinity of an antibody of known heavy and light 

9 chain sequence, libraries of unknown light chains 

10 co-expressed with a known heavy chain are screened for 

11 higher affinity antibodies. The known heavy chain 

12 antibody DNA sequence is joined to a nucleotide 

13 sequence encoding a oestrogen receptor DNA binding 

14 domain in a phage vector which does not contain the 

15 oestrogen receptor HRE sequence. The antibody DNA 

16 sequence for the known heavy chain (VH and CHI) gene is 

17 inserted in the 5' region of the oestrogen receptor DBD 

18 DNA , behind an appropriate promoter and translation 

19 sequences and a sequence encoding a signal peptide 
2 0 leader directing transport of the downstream fusion 

21 protein to the periplasmic space. The library of 

22 unknown light chains (VL and CL) is expressed 

23 separately from a phagemid expression vector which also 
2 4 contains the oestrogen receptor HRE sequence. Thus when 
25 both heavy and light chains are expressed in the same 

2 6 host cell, following infection with the phage 

27 containing the heavy chain-DBD fusion, the light chain 

28 phagemid vector is preferentially packaged into mature 

29 phage particles as single stranded DNA, which is bound 

3 0 by the heavy chain-DBD fusion protein during the 
31 packaging process. The light chain proteins are 

3 2 transported to the periplasm where they assemble with 

3 3 the heavy chain that is fused to the DBD protein as it 

34 exits the cell on the PDCP. In this example the DBD 

3 5 fusion protein and the HRE DNA sequences are not 

3 6 encoded on the same vector, the unknown peptide 
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1 sequences are present on the same vector as the HRE 

2 sequence. Peptide display carrier packages (PDCP) which 

3 encode the protein of interest can then be selected by 

4 means of a ligand specific for the antibody. 
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Table 1 (i) Oligonucleotide primers used for human scPv library construction 



cDNA synthesis primers 



IgMCDNAFOR TGGAAGAGGCACGTTCTTTTCTTT 

I gDCDNAFOR CTCCTTCTTACTCTTGCTGGCGGT 

I gK CDNAFOR AGACTCTCCCCTGTTGAAGCTCTT 

I gACDNAFOR TGAAG ATTCTGTAGGGGCCACTGTCTT 



.JHFOR primers 



JH1-2FOR TGAACCGCCTCCACCTGAGGAGACGGTGACCAGGGTGCC 

JH3 FOR TGAACCGCCTCCACCTGAAGAGACGGTGACCATTGTCCC 

JH4 - 5 FOR TGAACCGCCTCCACCTGAGGAGACGGTGACCAGGGTTCC 

JH6FOR TGAACCGCCTCCACCTGAGGAGACGGTGACCGTGGTCCC 



VH f amilyBAKprimers 



VH1BAK TTTTTGGCCCAGCCGGCCATGGCCCAGGTGCAGCTGGTGCAGTCTGG 

VH2BAK TTTTTGGCCCAGCCGGCCATGGCCCAGGTCAACTTAAGGGAGTCTGG 

VH3BAK TTTTTGGCCCAGCCGGCCATGGCCGAGGTGCAGCTGGTGGAGTCTGG 

VH4BAK TTTTTGGCCCAGCCGGCCATGGCCCAGGTGCAGCTGCAGGAGTCGGG 

VH5BAK TTTTTGGCCCAGCCGGCCATGGCCGAGGTGCAGCTGTTGCAGTCTGC 

VH6BAK TTTTTGGCCCAGCCGGCCATGGCCCAGGTACAGCTGCAGCAGTCAGG 



Light chain FOR primers 

SCFVKFOR TTATTCGCGGCCGCCTAAACAGAGGCAGTTCCAGATTTC 
SCFVAFOR GTCACTTGCGGCCGCCTACAGTGTGGCCTTGTTGGCTTG 



VK family BAK primers 



VK1BAK TCTGGCGGTGGCGGATCGGACATCCAGATGACCCAGTCTCC 

VK2BAK TCTGGCGGTGGCGG ATCGGATGTTGTGATG ACTC AGTCTCC 

VK3BAK TCTGGCGGTGGCGGATCGGAAATTGTGTTGACGCAGTCTCC 

VK4BAK TCTGGCGGTGGCGGATCGGACATCGTGATGACCCAGTCTCC 

VK5BAK TCTGGCGGTGGCGGATCGGAAACGACACTCACGCAGTCTCC 

VK6 BAK TCTGGCGGTGGCGGATCGGAAATTGTGCTGACTCAGTCTCC 
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J"K FOR primers 



XKIFOR TTCTCGTGCGGCCGCCTAACGTTTGATTTCCACCTTGGTCCC 

XK2FOR TTCTCGTGCGGCCGCCTAACGTTTGATCTCCAGCTTGGTCCC 

JK3FOR TTCTCGTGCGGCCGCCTAACGTTTGATATCCACTTTGGTCCC 

J"K4FOR TTCTCGTGCGGCCGCCTAACGTTTGATCTCCACCTTGGTCCC 

JK5FOR TTCTCGTGCGGCCGCCTAACGTTTAATCTCCAGTCGTGTCCC 



VA family BAK primers 

VAlBAK TCTGGCGGTGGCGGATCGCAGTCTGTGTTGACGCAGCCGCC 
VA2 BAK TCTGGCGGTGGCGGATCGCAGTCTGCCCTGACTCAGCCTGC 



Table 1 (ii) Oligonucleotide primers used for human scFv library construction 



VA3aBAK TCTGGCGGTGGCGGATCGTCCTATGTGCTGACTCAGCCACC 

VA3bBAK TCTGGCGGTGGCGGATCGTCTTCTGAGCTGACTCAGGACCC 

VA4BAK TCTGGCGGTGGCGGATCGCACGTTATACTGACTCAACCGCC 

VA5BAK TCTGGCGGTGGCGGATCGCAGGCTGTGCTCACTCAGCCGTC 

VA6BAK TCTGGCGGTGGCGGATCGAATTTTATGCTGACTCAGCCCCA 



JA primers 



JA1FOR TTCTCGTGCGGCCGCCTAACCTAGGACGGTGACCTTGGTCCC 
JA2-3FOR TTCTCGTGCGGCCGCCTAACCTAGGACGGTCAGCTTGGTCCC 
JA4-5FOR TTCTCGTGCGGCCGCCTAACCTAAAACGGTGAGCTGGGTCCC 



Linker primers 



LINKAMP3 CGATCCGCCACCGCCAGA 
LINKAMP5 GTCTCCT C AGGTGG AGGC 

LINKAMP3T CGATCCGCCACCGCCAGAGCCACCTCCGCCTGAACCGCCTCCACCTGAGGAGAC 
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l Claims 

2 

3 l. A peptide display carrier package (PDCP) , said 

4 package comprising a recombinant polynucleotide- 

5 chimeric protein complex wherein the chimeric 

6 protein has a nucleotide binding portion and a 

7 target peptide portion, wherein said recombinant 
s polynucleotide comprises a nucleotide sequence 

9 motif which is specifically bound by said 

io nucleotide binding portion, and wherein at least 

n the chimeric protein-encoding portion of the 

12 recombinant polynucleotide not bound by the 

13 chimeric protein nucleotide binding portion is 

14 protected by a binding moeity. 

15 

16 2. A peptide display carrier package (PDCP) as claimed 

17 in Claim 1, wherein said chimeric protein-encoding 
is portion of the recombinant polynucleotide not bound 

19 by the chimeric protein nucleotide binding portion 

20 is protected by a non-sequence-specific protein. 

21 

22 3. A peptide display carrier package (PDCP) as claimed 

23 in Claim 2, wherein said non-sequence-specific 

24 protein is a viral coat protein. 

25 

26 4. A peptide display carrier package (PDCP) as claimed 

27 in any one of Claims 1 to 3 , wherein said target 

28 peptide portion is displayed externally on the 

29 package. 
30 

31 5. A peptide display carrier package (PDCP) as claimed 

32 in any one of Claims 1 to 4 wherein said 

33 recombinant polynucleotide includes a linker 

34 sequence between the nucleotide sequence encoding 
3 5 the nucleotide binding portion and the nucleotide 
36 sequence encoding the target peptide portion. 
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1 6. A peptide display carrier package (PDCP) as claimed 

2 in any one of Claims 1 to 5 wherein said 

3 recombinant polynucleotide has two or more 

4 nucleotide sequence motifs each of which can be 

5 bound by the nucleotide binding portion of the 

6 chimeric protein. 

7 

8 7. A peptide display carrier package (PDCP) as claimed 

9 in any one of Claims l to 6 wherein said nucleotide 

10 binding portion is a DNA binding domain of an 

11 oestrogen or progesterone receptor. 

12 

13 8. A peptide display carrier package (PDCP) as claimed 

14 in any one of Claims l to 7 wherein said 

15 recombinant polynucleotide is bound to said 

16 chimeric protein as single stranded DNA. 

17 

18 9. A peptide display carrier package (PDCP) as claimed 

19 in any one of Claims 1 to 8 wherein said target 

20 peptide portion is located at the N and/ or C 

21 terminal of the chimeric protein. 
22 

23 10. A peptide display carrier package (PDCP) as claimed 

24 in any one of Claims 1 to 9 which is produced in a 

25 host cell transformed with said recombinant 

26 polynucleotide and extruded therefrom without lysis 

27 of the host cell. 

28 

29 11. A recombinant polynucleotide comprising a 

30 nucleotide sequence encoding a chimeric protein 

31 having a nucleotide binding portion operably linked 

32 to a target peptide portion, wherein said 

33 polynucleotide includes a specific nucleotide 

34 sequence motif which is bound by the nucleotide 

35 binding portion of said chimeric protein and 

36 further encoding a non-sequence-specific nucleotide 
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l binding protein. 

2 

3 12. A recombinant polynucleotide as claimed in Claim 11 

4 wherein said non-sequence-specific nucleotide 

5 binding protein is a viral coat protein. 

6 

7 13 . A recombinant polynucleotide as claimed in either 

8 one of Claims 11 and 12 which includes a linker 

9 sequence between the nucleotide sequence encoding 

10 the nucleotide binding portion and the nucleotide 

11 sequence encoding the target peptide portion. 
12 

13 14. A recombinant polynucleotide as claimed in any one 

14 of Claims 11 to 13 which has two or more nucleotide 
is sequence motifs each of which can be bound by the 
is nucleotide binding portion of the chimeric protein. 

17 

is 15. A recombinant polynucleotide as claimed in any one 

19 of Claims 11 to 14 wherein said nucleotide binding 

20 portion is a DNA binding domain of an oestrogen or 

21 progesterone receptor. 

22 

23 16. A recombinant polynucleotide as claimed in any one 

24 of Claims 11 to 15 wherein said recombinant 

25 polynucleotide is bound to said chimeric protein as 

26 single stranded DNA. 

27 

28 17. A genetic construct or set of genetic constructs 

29 which collectively comprises a polynucleotide 

30 having a sequence which includes: 

31 i) a sequence encoding a nucleotide binding 

32 portion able to recognise and bind to a 

33 specific sequence motif; 

34 ii) the sequence motif recognised and bound by the 

35 nucleotide binding portion encoded by (i); 

36 iii) a restriction enzyme site which permits 
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1 insertion of a polynucleotide, said site being 

2 designed to operably link said polynucleotide 

3 to the sequence encoding the nucleotide 

4 binding portion so that expression of the 

5 operably linked polynucleotide sequences 

6 yields a chimeric protein; and 

7 iv) a sequence encoding a nucleotide binding 

8 protein which binds non-specif ically to naked 

9 polynucleotide. 

10 

11 18. A genetic construct or set of genetic constructs as 

12 claimed in Claim 17 wherein a linker sequence is 

13 located between the nucleotide sequence encoding 

14 the nucleotide binding portion and the site for 

15 insertion of the polynucleotide. 

16 

17 19. A genetic construct or set of genetic constructs as 

18 claimed in either one of Claims 17 and IS which 

19 includes a vector pDM12 , pDM14 or pDM16, deposited 

20 at NCIMB under Nos 40970, 40971 and 40972 

21 respectively. 

22 

23 20. A method of constructing a genetic library, said 

24 method comprising: 
25 

26 a) constructing multiple copies of a recombinant 

27 vector comprising a polynucleotide sequence 

28 which encodes a nucleotide binding portion 

29 able to recognise and bind to a specific 

30 sequence motif; 
31 

32 b) operably linking each said vector to a 

33 polynucleotide encoding a target polypeptide, 

34 such that expression of said operably linked 

35 vector results in expression of a chimeric 

36 protein comprising said target peptide and 
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1 said nucleotide binding portions; wherein said 

2 multiple copies of said operably linked 

3 vectors collectively express a library of 

4 target peptide portions; 

5 

6 c) transforming host cells with the vectors of 

7 step b) ; 

8 

9 d) culturing the host cells of step c) under 

20 conditions suitable for expression of said 
11 chimeric protein; 

12 

23 e) providing a recombinant polynucleotide 

14 comprising the nucleotide sequence motif 

is specifically recognised by the nucleotide 

16 binding portion and exposing this 

27 polynucleotide to the chimeric protein of step 

is d) to yield a polynucleotide-chimeric protein 

29 complex; and 

20 

21 f) causing production of a non-sequence-specific 

22 moiety able to bind to the non-protected 

23 portion of the polynucleotide encoding the 

24 chimeric protein to form a peptide display 

25 carrier package. 

26 

27 21. A method of screening a genetic library, said 

28 method comprising: 

29 

30 a) exposing the polynucleotide members of said 

31 library to multiple copies of a genetic 

32 construct comprising a nucleotide sequence 

33 encoding a nucleotide binding portion able to 

34 recognise and bind to a specific sequence 

35 motif, under conditions suitable for the 

36 polynucleotides of said library each to be 
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1 individually ligated into one copy of said 

2 genetic construct, to create a library of 

3 recombinant polynucleotides; 

4 

5 b) exposing said recombinant polynucleotides to a 

6 population of host cells, under conditions 

7 suitable for transformation of said host cells 

8 by said recombinant polynucleotides; 
9 

20 c) selecting for transformed host cells; 
11 

22 d) exposing said transformed host cells to 

13 conditions suitable for expression of said 

24 recombinant polynucleotide to yield a chimeric 

25 protein ; and 

16 

17 e) providing a recombinant polynucleotide 

28 comprising the nucleotide sequence motif 

29 specifically recognised by the nucleotide 

20 binding portion and exposing this 

21 polynucleotide to the chimeric protein of step 

22 d) to yield a polynucleotide-chimeric protein 

23 complex; 

24 

25 f) protecting any exposed portions of the 

26 polynucleotide in the complex of step e) to 

27 form a peptide display carrier package; and 
28 

29 g) screening said peptide display carrier package 

30 to select only those packages displaying a 

31 target peptide portion having the 

32 characteristics required. 

33 

34 22. A method as claimed in Claim 21 wherein the peptide 

35 display package carrier is extruded from the host 

36 cell without lysis thereof. 
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23. A polynucleotide comprising a nucleotide sequence 

substantially as set out in SEQ ID No. 15 or SEQ ID 
No . 17 . 
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Figure 1 1/9 

1 pelB 

MET LIS TZR W WJ PRO T2R 21k MA ALA SLY M 
AAGCTT (nTa^TTCraTTTCmaGaQCTglTM ATG AAA TAC CTA TTG CCT ACG GCA GCC GCT GGA TTG 
TTCGAi\CGT;£mmT^AOTCCTCTGTQGTATT TAG TTT ATG GAT AAC GGA TGC CGT CGG CGA CCT AAC 

ffift dill 

77 #1 ■ JW jfctl 

I2U 1^ ALA A& i& M AM &U VAL GL\ ^ 

TTA TTA CTC G CG GCC GAG CCG GCC ATG GCC GAG GTG CAA CTG CAG TAA TAG GCG GCC GCA 

AAT AAT GAG CSC CGG GTC GGC CGG TAC CGG GTC CAC GTC GAC GTC ATT ATC CGC CGG CGT 



137 



GLI GLI GLT GLT SER 
GGG GGA GGA GGG TCC 
CCC CCT CCT GCC AGG 



MET GLU SER ALA LIS GLU THR ARG TYR CIS ALA VAL CYS ASN ASP 
ATG GAA TCT GCC AAG GAG ACT CGC TAC TGT GCA GTG TGC AAT GAC 
TAG CTT AGA CGG TTC CTC TGA GCG ATG ACA CGT CAC ACG TTA CTG 



137 " 



257- 



317 



TO AU SER GLI TYR HIS TYR GLI VAL TRP SER CIS GLU GLI CIS LYS ALA PEE PHE LIS 
TAT GCT TCA GGC TAC CAT TAT GGA GTC TGG TCC TGT GAG GGC TGC AAG GCC TTC TTC AAG 
ATA CGA AGT CCG ATG GTA ATA CCT CAG ACC AGG ACA CTC CCG ACG TTC CGG AAG AAG TTC 

ARG SER ILZ GIN GLI HIS ASN AS? TYR MET CIS PRO ALA THR ASN GLK CYS THR ILZ ASP 
AGA AGT ATT CAA GGA CAT AAC GAC TAT ATG TGT CCA GCC ACC AAC CAG TGC ACC ATT GAT 
TCT TCA TAA GTT CCT GTA TTG CTG ATA TAC ACA GGT CGG TGG TTG GTC ACG TGG TM CTA 



"Oestrogen receptor DBD" 



LIS ASN ARG ARG LYS SER CIS GLN ALA CYS ARG LEU ARG LIS CIS TYR GLU VAL GLI MET 
AAA AAC AGG AGG AAG AGC TGC CAG GCC TGC CGG CTC CGT AAA TGC TAC GAA GTG GGA ATG 
TTT TTG TCC TCC TTC TCG ACG GTC CGG ACG GCC GAG GCA TTT ACG ATG CTT CAC CCT TAC 



377- 



JET LIS GLI GLI ILS ARG LIS ASP ARG ARG GLI GLI ARG MET LEU LIS HIS LIS ARG GLU 
ATG AAA GGT GGG ATA CGA AAA GAC CGA AGA GGA GGG AGA ATG TTG AAA CAC AAG CGC CAG 
TAC TTT CCA CCC TAT GCT TTT CTG GCT TCT CCT CCC TCT TAC AAC TTT GTG TTC GCG GTC 



437 ' 1 

ARG ASP ASP GLI GLU GLI ARG GLI GLU VAL GLI SER "* *** 
AGA GAT GAT GGG GAG GGC AGG GGT GAA GTG GGG TCT TGA TAA 
TCT CTA CTA CCC CTC CCG TCC CCA CTT CAC CCC AGA ACT ATT 



TCAGGTCAGAGTGACCTGAGC^^ iAATTC 
AGTCCAGTTOCTmCfflTTrTATTGTGTAAGTC [TTAAG 
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Figure 3 3/9 

Human Igk constant region 

RRTVAAPSV 
AAACGAACTGTGGCTGCACCATOTC 

Clone#2 

M Aig P T I R P G Q G T R L D I I R I V A A P S V 

ATGGCCCAGCCCACCACGCGTCCGGGCCAAGGGACACGACTGGACAfTAAACGAACTGTGGCTGCACCATCTGTC 
Clone 13 

M A^Q S H H A S G G G T X V E I R R T V A A P S V 
ATGGCCC^GTCCCACCACGCGTCCGGCGGAGGGACCMGGTGGAGATCAAACGAACTGTGGCTGCACCATCTGTC 

Human Igk constant region 

FIFPPSDEQLRSGTASVVCLLNNFI 
TTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTAT 
Clone #2 

FIFPPSDEQLRSGTASVVCLLNNFI 
TTCATCTTCCCGCCATCTGATGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTAT 
Clone 13 

FIFPPSDEQLRSGTASVVCLLNNFI 
TTCATCTTCCCGCCATCTMGAGCAGTTGAAATCTGGAACTGCCTCTGTTGTGTGCCTGCTGAATAACTTCTAT 
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Figure 4 4/9 pe iB 

Ufa dm he xrs m lev lev mo mmm 

AAOTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATAA ATG AAA TAC CTA TTG CCT ACG GCA GCC 
TTCGAACGTA CGTTTAAGAT AAAGTTCCTC TGTCAGTATT TAC TTT ATG GAT AAC GGA TGC CGT CGG 

58 Sfil ftfl 

ALU GLY LEV LEU LEU LEV AM ALA GIN PRO ALA MET ALA GLU VA1 GLN LEU GLN *** *** 
GCT GGA TTG TTA TTA CTC GC G GCC CAG CCG GCC ATG GCC GAG GTG CAA CTG CAG TAA TAG 
CGA CCT AAC AAT AAT GAG CGC CGG GTC GGC CGG TAC CGG CTC CAC GTT GAC GTC ATT ATC 

128 M 

GLY GLI GLY GLY SERlMET GLU SER ALA LYS GLU THR ARG TYR CYS ALA VAL 
GGG GGA GGA GGG TCC ATG GAA TCT GCC AAG GAG ACT CGC TAC TGT GCA GTG 
CCC CCT CCT CCC AGG[ TAC CTT AGA CGG TTC CTC TGA GCG ATG ACA CGT CAC 



ALA ALA ALA 
GCG GCC GCA 
CGC CGG CGT 
188 



CYS 


ASN 


ASP 


TYR 


ALA 


SER 


GLY 


TYR HIS 


TYR 


GLY VAL TRP SER CYS GLU GLY CYS LYS ALA 


mnn 


AAT 


GAC 


TAT 


GCT 


TCA 


GGC 


TAC CAT 


TAT 


GGA GTC TGG TCC TGT GAG GGC TGC AAG GCC 


ACG 


TTA 


CTG 


ATA 


CGA 


AGT 


CCG 


ATG GTA 


ATA 


CCT CAG ACC AGG ACA CTC CCG ACG TTC CGG 



248 



PHE 


PHE 


LYS 


ARG 


SER 


ILE 


GLN 


GLY HIS 


ASN 


TTC 


TTC 


AAG 


AGA 


AGT 


ATT 


CAA 


GGA CAT 


AAC 


AAG 


AAG 


TTC 


TCT 


TCA 


TAA 


GTT 


CCT GTA 


TTG 


308 


















THR 


ILE 


ASP 


LYS 


ASN 


ARG 


ARG 


LYS SER 


CYS 


ACC 


ATT 


GAT 


AAA 


AAC 


AGG 


AGG 


AAG AGC 


TGC 


TGG 


TAA 


CTA 


TTT 


TTG 


TCC 


TCC 


TTC TCG 


ACG 


368 


















VAL 


GLY 


MET 


MET 


LYS 


GLY 


GLY 


ILE ARG 


LYS 


GTG 


GGA 


ATG 


ATG 


AAA 


GGT 


GGG 


ATA CGA 


AAA 


CAC 


CCT 


TAC 


TAC 


TTT 


CCA 


CCC 


TAT GCT 


TTT 


428 


















LYS 


ARG 


GLN 


ARG 


ASP 


ASP 


GLY 


GLU GLY 


ARG 


AAG 


CGC 


CAG 


AGA 


GAT 


GAT 


GGG 


GAG GGC 


AGG 



HEE1 



TTC GCG GTC TCT CTA CTA CCC CTC CCG TCC CCA CTT_CAC CCC AGA ACT ATT 

491 HEE1 WI 



TCAGGTCAGAGl 
AGTCCAGTCTC2 



3ACCTGAGCTAAAATAACACATTCAG 
ITGGACTCGATTTTATTGTGTAAGTC 



GTCGAC 



CAGCTG 



TTGGGTCAGTCTGACCGGGACAAAGTTAATGTAACCTC 
AACCCAGTCAGACTGGCCCTGTTTCAATTACATTGGAG 



GAATTC 



CTTAAG 
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Figure 5 5/9 1B 

iSinBill *® T mm leu m mmm 

AAGCTT GCAT GCAAATTCIA TTTCAAGGAG ACAGTCATAA ATG AAA TAC CTA TTG CCT ACG GCA GCC 
TTCGAACGTA CGTTTAAGAT AAAGTTCCTC TGTCAGTATT TAC TTT ATG GAT AAC GGA TGC CGT CGG 



ALA GLY LEU LEU LEU LEU ALA ALA GLN PRO ALA 
GCT GGA TTG TTA TTA CTC GCG GCC CAG CCG GCA 
CGA CCT AAC AAT AAT GAG CGC CGG GTC GGC CGT 
128 

TER ARG TYR CYS ALA VAL CYS ASN ASP TYR ALA 
ACT CGC TAC TGT GCA GTG TGC AAT GAC TAT GCT 
TGA GCG ATG ACA CGT CAC ACG TTA CTG ATA CGA 
188 

CYS GLU GLI CYS LYS ALA PHE PHE LYS ARG SER 
TGT GAG GGC TGC AAG GCC TTC TTC AAG AGA AGT 
ACA CTC CCG ACG TTC CGG AAG AAG TTC TCT TCA 
248 

PRO ALA THR ASN GLN CYS THR HE ASP LYS ASN 
CCA GCC ACC AAC CAG TGC ACC ATT GAT AAA AAC 
GGT CGG TGG TTG GTC ACG TGG TAA CTA TTT TTG 
308 

LEU ARG LYS CYS TYR GLU VAL GLY MET MET LYS 
CTC CGT AAA TGC TAC GAA GTG GGA ATG ATG AAA 
GAG GCA TTT ACG ATG CTT CAC CCT TAC TAC TTT 
368 

GLY ARG MET LEU LYS HIS LYS ARG GLN ARG ASP 
GGG AGA ATG TTG AAA CAC AAG CGC CAG AGA GAT 
CCC TCT TAC AAC TTT GTG TTC GCG GTC TCT CIA 

«28 $/n 



SER 
TCT 
AGA 
489, 



GLY GLY GLY GLY SER 
GGG GGA GGA GGG TCG 



AGTCGAC ETG GGTCAGTCTG ACCGGGACAA AGTTAATGTA 
TCAGCTG &C CCAGTCAGAC TGGCCCTGTT TCAATTACAT 



MET ALA GLU MET GLU SER ALA LYS GLU 
ATG GCC GAG ATG GAA TCT GCC AAG GAG 
TAC CGG CTC TAC CTT AGA CGG TTC CTC 

SER GLY TYR HIS TYR GLY VAL TRP SER 
TCA GGC TAC CAT TAT GGA GTC TGG TCC 
AGT CCG ATG GTA ATA CCT CAG ACC AGG 

HE GLN GLY HIS ASN ASP TYR MET CYS 
ATT CAA GGA CAT AAC GAC TAT ATG TGT 
TAA GTT CCT GTA TTG CTG ATA TAC ACA 

ARG ARG LYS SER CYS GLN ALA CYS ARG 
AGG AGG AAG AGC TGC CAG GCC TGC CGG 
TCC TCC TTC TCG ACG GTC CGG ACG GCC 

GLY GLY ILE ARG LYS ASP ARG ARG GLY 
GGT GGG ATA CGA AAA GAC CGA AGA GGA 
CCA CCC TAT GCT TTT CTG GCT TCT CCT 

ASP GLY GLU GLY ARG GLY GLU VAL GLY 
GAT GGG GAG GGC AGG GGT GAA GTG GGG 
CTA CCC CTC CCG TCC CCA CTT CAC CCC 

M Ml 

LEU GLN LEU ALA ALA ALA TER 

CTG CAG CTG GCG GCC GCA TAACTGATTG 

CGG GTC GGC CGG GAG GAC GTC GAC CGC CGG CGT ATTGACTAAC 

GAATTC 



ALA GLN PRO ALA LEU 
GCC CAG CCG GCC CTC 



ACCTC 
TGGAG 



CTTAAG 
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QVQLQQSGGGVVQPGRSL 

CAGGTACAGCTGCAGCAGTCAGGGGGAGGCGTGGTCCAGCCTGGGAGGTCCCTG 

GTCCATGTCGACGTCGTCAGTCCCCCTCCGCACCAGGTCGGACCCTCCAGGGAC 

RLSCAASGFPFS T Y G M H W 
AGACTCTCCTGTGCAGCCTCGGGATTCCCCTTTAGTACTTZ^GGC^rGCACTGG 
TCTGAGAGGACACGTCGGAGCCCTAAGGGGAAAT CATGAATACCGTACGTGACC 

RQAVPGKGLEWVA V I S Y D 

CGCCAGGCTGTCCCAGGCAAGGGGCTGGAGTGGGTGGCAGTTATATCAIATGSJT 

GQ3GTCCGACAGGX5TCCGTTCCCCGA^ 

G S NKYYAD SVKG R F T I S R 

DNSKNTLYLQMNSLRAED 

GACAATTCCAAGAACACGTTGTATCTGCAAATGAACAGCCTGAGAGCTGAGGAC 

CTGTTAAGGTTCTTGTGCAACATAGACGTTTACTTGTCGGACTCTCGACTCCTG 

TAVYYCAR DIiDPTRYSSG 
ACGGCTGTGTATTACTGTGCGAGAGATTTAGAC^ 

TGCCGACACATAATGACACGCTCTCTAAZ^CTGGGGTGGTCCATZ^CGTCACCG 

W D T D Y W GQGHLVTVSS 

^GGGACACfGACTACTGGGGCCAGGGGCACCTGGTCACTGTCTCCTCA 

ACCCTGTGACTGATGACCCCGGTCCCCGTGGACCAGTGACAGAGGAGT 

LIGHT CHAIN ^ ^ o 

ETTLTQSPGTLSLSPGER 

GAAACGACACTCACGCAGTCTCCAGGCACCCTGTCTTTGTCTCCGGGGGAAAGA 

CTTTGCTGTGAGTGCGTCAGAGGTCCGTGGGACAGAAACAGAGGCCCCCTTTCT 

A T L S C RASQNIGSS SLAW 

GCCACCCTCTCCTGCAGGGCCAGTCAGAATATTGGCAGCAGCTCCT^ 

CGGTGGGAGAGGACGTCCCGGTCAGTCTTATA^ 

YQQKPGQAPRLLIY G A S T 

TACCAACAGAAACCTGGCCAGGCTCCCAGGCTCCTCATCTATGGTGCATCGACC 

ATGGTTGTCTTTGGACCGGTCCGAGGGTCCGAGGAGTAGATACCACGTAGGTGG 

RATGFSGSGSGTQFTLTI 

AGGGCCACfGGTTTCAGTGGCAGTGGGTCAGGGACACAATTCACTCTCACCATC 

TCCCGGTGACCAAAGTCACCGTCACCCAGTCCCTGTGTTAAGTGAGAGTGGTAG 

I PARS SLQS EDFAVYYCQ 

ATCCCAGCCAGGAGCAGCCTGCAGTCTGAAGATTTTGCAGTTTATTACTGTCAG 

TAGGGTCGGTCCTCGTCGGACGTCAGACTTCTAAAACGTCAAATAATGACAGTC 

Q YNFWPFT F GPGTKLEIK 

CAGTATAATTTCTGGCCATTC^TTTTGGCCCTGGGACCAAGCTGGAGATCAAA 

GTCATATTAAAGACCGGTAAGTGAAAACCGGGA 

R 

CGT 
GCA 
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SEQUENCE LISTING 



(1> GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Rowett Research Services Limited 

(B) STREET: Greenburn 

(C) CITY: Buckburn ABERDEEN 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP) : AB21 9SB 

(ii) TITLE OF INVENTION: Chimeric binding peccjice library scre*r.i.-.g 
method 

(iii) NUMBER OF SEQUENCES: 76 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: I3M PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release sl.0, Vars.ir. =1.30 (EPOS 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
{A) LENGTH: 3S base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synchetic DMA" 

(iii) HYPOTHETICAL: MO 

(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
TCAGGTCAGA GTGACCTGAG CTAAAATAAC A.CATTCAG 3 3 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS.: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AGTCCAGTCT CACTGGACTC GATTTTATTG TGTAAGTC 
(2) INFORMATION FOR SEQ ID NO: 3: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 521 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 41. .475 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

AAGCTT GCAT GCAAATTCTA TTTCAAGGAG AC AGT CAT AA ATG AAA TAC CTA TTG 

Met Lys Tyr Leu Leu 
1 5 



CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCG GCC CAG CCG GCC ATG 
Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met 
10 15 20 

GCC CAA. GTG CAG CTG CAG TAA TAG GCG GCC GCA GGG GGA GGA GGG TCC 
Ala Gin Val Gin Leu Gin * * Ala Ala Ala Gly Gly Gly Gly Ser 
25 30 35 

ATG GAA TCT GCC AAG GAG ACT CGC TAC TGT GCA GTG TGC AAT GAC TAT 
Met Glu Ser Ala Lys Glu Thr Arg Tyr Cys Ala Val Cys Asn Asp Tyr 
40 45 50 

GCT TCA GGC TAC CAT TAT GGA GTC TGG TCC TGT GAG GGC TGC AAG GCC 
Ala Ser Gly Tyr His Tyr Gly Val Trp Ser Cys Glu Gly Cys Lys Ala 
55 60 65 

TTC TTC AAG AGA AGT ATT CAA GGA CAT AAC GAC TAT ATG TGT CCA GCC 
Phe Phe Lys Arg Ser He Gin Gly His Asn Asp Tyr Met Cys Pro Ala 
70 75 80 85 

ACC AAC CAG TGC ACC ATT GAT AAA AAC AGG AGG AAG AGC TGC CAG GCC 
Thr Asn Gin Cys Thr He Asp Lys Asn Arg Arg Lys Ser Cys Gin Ala 
90 95 100 

TGC CGG- CTC CGT AAA TGC TAC GAA GTG GGA ATG ATG AAA GGT GGG ATA 
Cys Arg Leu Arg Lys Cys Tyr Glu Val Gly Met Met Lys Gly Gly He 
10S no us 

CGA AAA GAC CGA AGA GGA GGG AGA ATG TTG AAA CAC AAG CGC CAG AGA 
Arg Lys Asp Arg Arg Gly Gly Arg Met Leu Lys His Lys Arg Gin Arg 
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120 125 130 

GAT GAT GGG GAG GGC AGG GGT GAA GTb GGG TCT TGA TAA7CAGGTC 
Asp Asp Gly Glu Gly Arg Gly Glu Val Gly Ser * 
135 140 145 

AGAGTGACCT GAGCTAAAAT AACACAT T C A GAATTC 



(2) INFORMATION FOR SEQ ID NO: 4: 

(if SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 145 amino acids 
(Bi TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
15 10 15 

Ala Gin Pro Ala Met Ala Gin Val Gin Leu Gin * ' Ala Ala Ala 
20 25 30 

Gly Gly Gly Gly Ser Met Glu Ser Ala Lys Giu Thr Arg Tyr Cys Ala 
35 40 45 

Val Cys Asn Asd Tvr Ala Ser Gly Tvr Kis Tyr Gly Val Trp Ser Cys 
50 55 60 

Glu Gly Cys Lvs Ala Phe Phe Lys Arg Ser lie Gin Giv His Asn Asp 
65 " 70 75 80 

Tyr Met Cys Pro Ala Thr Asn Gin Cys Thr lie Asp Lys Asn Arg Arg 
85 90 95 

Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cvs Tyr Glu Val Gly Met 
100 105 110 

Met Lys Gly Gly lie Arg Lys Asd Arg Arg Giv Glv Arg Met Leu Lys 
115 12*0 125 

His Lvs Arg Gin Arg Asp Asp Gly Glu Gly Arg Gly Glu Val Gly Ser 
130 135 140 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: MO 
(iv} ANTI-SENSE; MO 
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(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION : 1 . .102 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

AAA CGA ACT GTG GCT GCA CCA TCT GTC TTC ATC TTC CCG CCA TCT GAT 
Lys Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp 
150 155 160 

GAG CAG TTG AAA TCT GGA ACT GCC TCT GTT GTG TGC CTG CTG AAT AAC 
Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn 
16S 170 175 

TTC TAT 
Phe Tyr 



(2) INFORMATION FOR SEQ ID NO: 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 amino acids 

(B) TYPE: airtino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE' TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Lys Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asp 
15 10 15 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn 
20 25 30 

Phe Tyr 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO ■ 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. . 150 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
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ATG GCC CAG CCC ACC ACG CGT CCG GGC CAA GGG ACA CGA CTG GAC ATT 
Mec Ala Gin Pro Thr Thr Arg Pro Giy Gin Gly Thr Arg Leu As? lie 
35 40 45 50 

AAA CGA ACT GTG GCT GCA CCA TCT GTC TTC ATC TTC CCG CCA TCT GAT 
Lys Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 
55 60 65 

GAG CAG TTG AAA TCT GGA ACT GCC TCT GTT GTG TGC CTG CTG AAT AAC 
Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn 
70 75 SO 

TTC TAT 
Phe Tyr 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 50 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: S: 

Met Ala Gin Pro Thr Thr Arg Pro Gly Gin Gly Thr Arg Leu As? lie 
1 5 10 15 

Lvs Arg Thr Val Ala Ala Pro Ser Val Phe He Phe Pro Pro Ser Asp 
20 25 30 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn 
35 40 45 

Phe Tyr 
50 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE : 

(A) NAME/ KEY: CDS 
(8) LOCATION: 1 . . 150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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ATG GCC CAG TCC CAC CAC GCG TCC GGC GGA GGG ACC AAG GTG GAG ATC 
Met Ala Gin Ser His His Ala Ser Glv Glv Glv Thr Lys Val Glu lie 
55 60 " 55 

AAA CGA ACT GTG GCT GCA CCA TCT GTC TTC ATC TTC CCG CCA TCT GAT 
Lys Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser Asn 
70 75 80 

GAG CAG TTG AAA TCT GGA ACT GCC TCT GTT GTG TGC CTG CTG AAT AAC 
Glu Gin leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn 



TTC TAT 
Phe Tyr 
100 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Gin Ser His His Ala Ser Glv Gly Glv Thr Lvs Val Glu lie 
15 10 15 

Lys Arg Thr Val Ala Ala Pro Ser Val Phe lie Phe Pro Pro Ser As? 
20 25 30 

Glu Gin Leu Lys Ser Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn 



Phe Tyr 
50 

(2) INFORMATION FOR SEQ ID NO: 11: 

U) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 41. .4' 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATAA ATG AAA TAC CTA TTG 
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7 Met Lys Tyr Leu Leu 

55 

CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCG GCC CAG CCG GCC ATG 
Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Gin Pro Ala Met 
60 65 70 

GCC GAG GTG CAA CTG CAG TAA TAG GCG GCC GCA GGG GGA GGA GGG TCC 
Ala Glu Val Gin Leu Gin * * Ala Ala Ala Gly Gly Gly Gly Ser 
75 80 35 

ATG GAA TCT GCC AAG GAG ACT CGC TAC TGT GCA GTG TGC AAT GAC TAT 
Met Glu Ser Ala Lys Glu Thr Arg Tyr Cys Ala Val Cys Asn Asp Tyr 
90 95 100 

GCT TCA GGC TAC CAT TAT GGA GTC TGG TCC TGT GAG GGC TGC AAG GCC 
Ala Ser Gly Tvr His Tyr Gly Val Trp Ser Cys Glu Gly Cys Lys Ala 
105 * HO 115 

TTC TTC AAG AGA AGT ATT CAA GGA CAT AAC GAC TAT ATG TGT CCA GCC 
Phe Phe Lys Arg Ser He Gin Gly His Asn Asp Tyr Met Cys Pro Ala 
120 125 130 135 

ACC AAC CAG TGC ACC ATT GAT AAA AAC AGG AGG AAG AGC TGC CAG GCC 
Thr Asn Gin Cys Thr He Asp Lys Asn Arg Arg Lys Ser Cys Gin Ala 
140 145 150 

TGC CGG CTC CGT AAA TGC TAC GAA GTG GGA ATG ATG AAA GGT GGG ATA 
Cys Arg Leu Arg Lys Cys Tyr Glu Val Gly Met Met Lys Gly Gly He 
155 160 165 

CGA AAA GAC CGA AGA GGA GGG AGA ATG TTG AAA CAC AAG CGC CAG AGA 
Arg Lys Asp Arg Arg Gly Gly Arg Met Leu Lys His Lys Arg Gin Arg 
170 175 180 

GAT GAT GGG GAG GGC AGG GGT GAA GTG GGG TCT TGA TAATCAGGTC 
Asp Asp Gly Glu Gly Arg Gly Glu Val Gly Ser * 
185 190 195 

AGAGTGACCT GAGCTAAAAT AACACATTCA GGTCGACTTG GGTCAGTCTG ACCGGGACAA 

AGTTAATGTA ACCTCGAATT C 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 145 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
1 5 10 15 

Ala Gin Pro Ala Met Ala Glu Val Gin Leu Gin * * Ala Ala Ala 
20 25 30 

Gly Gly Gly Gly Ser Met Glu Ser Ala Lys Glu Thr Arg Tyr Cys Ala 
35 40 45 
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Val Cys Asn Asp Tyr Ala Ser Gly Tyr His Tyr Gly Vai T:d Ser Cys 
50 55 60 

Glu Gly Cvs Lys Ala Phe Phe Lys Arg Ser lie Gin Glv His Asn Aso 
65 " 70 75 8*0 

Tyr Met Cvs Pro Ala Thr Asn Gin Cys Thr lie Asp Lvs Asn Arg Arg 
85 90 " 95 

Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val Glv Met 
100 105 110 

Met Lvs Gly Glv lie Arg Lys Asa Arg Arg Gly Glv Arg Met Leu Lvs 
115 * 120 125 

His Lys Arg Gin Arg Asp Asp Gly Glu Gly Arg Gly Glu Val Gly Ser 
130 135 140 



145 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 41. .481 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

AAGCTTGCAT GCAAATTCTA TTTCAAGGAG ACAGTCATAA ATG AAA TAC CTA TTG 55 

Met Lvs Tvr Leu Leu 
150 

CCT ACG GCA GCC GCT GGA TTG TTA TTA CTC GCG GCC CAG CCG GCA ATG 103 
Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala Ala Glr. Pro Ala Met 
155 150 155 

GCC GAG ATG GAA TCT GCC AAG GAG ACT CGC TAC TGT GCA GTG TGC AAT 151 
Ala Glu Met Glu Ser Ala Lvs Glu Thr Arg Tyr Cys A.ia Vai Cys Asn 
170 175 130 

GAC TAT GCT TCA GGC TAC CAT TAT GGA GTC TGG TCC TGT GAG GGC TGC 199 
Asd Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp Ser Cys Glu Gly Cys 
185 190 195 

AAG GCC TTC TTC AAG AGA AGT ATT CAA GGA CAT AAC GAC TAT ATG TGT 247 
Lys Ala Phe Phe Lys Arg Ser lie Gin Gly His Asn Asp Tyr Met Cys 
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200 205 210 

CCA GCC ACC AAC CAG TGC ACC ATT GAT AAA AAC AGG AGG AAG AGC TGC 295 
Pro Ala Thr Asn Gin Cys Thr lie Asp Lvs Asn Arg Arg Lvs Ser Cvs 
215 220 225 . * 230 

CAG GCC TGC CGG CTC CGT AAA TGC TAC GAA GTG GGA ATG ATG AAA GGT 343 
Gin Ala Cvs Arg Leu Arg Lys Cys Tyr Giu Val Gly Met Met Lys Gly 
235 240 245 

GGG ATA CGA AAA GAC CGA AGA GGA GGG AGA ATG TTG AAA CAC AAG CGC 391 
Gly lie Arg Lys Asp Arg Arg Gly Gly Arg Met Leu Lys His Lvs Arg 
250 255 260 

CAG AGA GAT GAT GGG GAG GGC AGG GGT GAA GTG GGG TCT GGG GGA GGA 439 
Gin Arg Asp Asp Gly Glu Gly Arg Gly Glu Val Gly Ser Glv Gly Gly 
265 270 275 

GGG TCG GCC CAG CCG GCC CTC CTG CAG CTG GCG GCC GCA TAA 431 
Gly Ser Ala Gin Pro Ala Leu Leu Gin Leu Ala Ala Ala * 
280 285 290 

CTGATTGAGT CGACTTGGGT CAGTCTGACC GGGACAAAGT TAATGTAACC TCGAATTC 539 



(2) INFORMATION FOR SEQ ID MO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 147 amino acids 
{ B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Lvs Tvr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Leu Ala 
1*5 10 15 

Ala Gin Pro Ala Met Ala Glu Met Glu Ser Ala Lys Giu Thr Arg Tyr 
20 25 30 

Cvs Ala Val Cys Asn Asp Tyr Ala Ser Gly Tyr His Tyr Gly Val Trp 
35 40 45 

Ser Cys Glu Gly Cys Lys Ala Phe Phe Lvs Arg Ser lie Gin Gly Kis 
50 55 60 

Asn Asd Tvr Met Cys Pro Ala Thr Asn Gin Cys Thr lie Aso Lvs Asn 
65 70 75 80 

Arg Arg Lys Ser Cys Gin Ala Cys Arg Leu Arg Lys Cys Tyr Glu Val 



Glv Met Met Lys Gly Gly lie Arg Lys Asp Arg Arg Gly Gly Arg Met 
100 105 * " 110 

Leu Lys His Lys Arg Gin Arg Aso Asd Glv Glu Gly Arg Glv Giu Val 
115 120 125 

Gly Ser Gly Gly Gly Gly Ser Ala Gin Pro Ala Leu Leu Gin Leu Ala 
130 135 140 
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(2) INFORMATION FOR SEQ ID NO; 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(ix> FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1 . -372 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

CAG GTA CAG CTG CAG CAG TCA GGG GGA GGC GTS GTC CAG CCT GGG AGG 4 8 

Gin Val Gin Leu Gin Gin Ser Gly Glv Glv Vai Val Gin Pro Glv Arg 
150 155 ' 150 . 

TCC CTG AGA CTC TCC TGT GCA GCC TC3 GGA TTC CCC TTT AGT ACT TAT 96 
Ser Leu Arg Leu Ser Cys Ala Ala Ser Glv Phe Pro Phe Ser Thr Tyr 
165 170 175 

GGC ATG CAC TGG CGC CAG GCT GTC CCA GGC AAG GGG CTG GAG TGG GTG 144 
Gly' Met Kis Tro Arg Gin Ala Val Pro Giy Lys Gly Leu Glu Trp Val 
180 185 190 195 

GCA GTT ATA TCA TAT GAT GGA AGT AAT AAA TAC TAC GCA GAC TCC GTG 192 
Ala Val lie Ser Tyr Asp Gly Ser Asn Lys Tyr Tyr Ala As? Ser Val 
200 205 210 

AAG GGC CGA TTC ACC ATC TCC AGA GAC AAT TCC AAG AAC ACG TT3 TAT 240 
Lvs Gly Arg Phe Thr lie Ser Arg Asd Asn Ser Lys Asn Thr Leu Tyr 
215 220 225 

CTG CAA ATG AAC AGC CTG AGA GCT GAG GAC ACG GCT GTG TAT TAC TGT 288 
Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr T'/r Cys 
230 235 240 

GCG AGA (SAT TTA GAC CCC ACC AGG TAT AGC AGT GGC TGG GAC ACT GAC 336 
Ala Arg Asd Leu Asp Pro Thr Arg Tyr Ser Ser Gly Trp Aso Thr Asd 
245 ' 250. 255 

TAC TGG GGC CAG GGG CAC CTG GTC ACT GTC TCC TCA 372 
Tyr Trp Gly Gin Gly His Leu Val Thr Val Ser Ser 
260 265 270 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 124 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Gin Val Gin Leu Gin Gin Ser Gly Gly Gly Val Val Gin Pro Gly Arg 
1 5 10 15 

Ser Leu Arg Leu Ser Cys Ala Ala Ser Gly Phe Pro Phe Ser Thr Tyr 
20 25 30 

Glv Met His Trp Arg Gin Ala Val Pro Gly Lys Gly Leu Glu Trp Val 
35 40 45 

Ala Val He Ser Tyr Asp Gly Ser Asn Lys Tyr Tyr Ala Asp Ser Val 
50 55 60 

Lys Gly Arg Phe Thr He Ser Arg Asp Asn Ser Lys Asn Thr Leu Tyr 
S5 70 "75 80 

Leu Gin Met Asn Ser Leu Arg Ala Glu Asp Thr .Ala Val Tyr Tyr Cys 
85 90 95 

Ala Arg Asn Leu Aso Pro Thr Arg Tyr Ser Ser Gly Tr? As? Thr Asp 
" 100 * 105 HO 

Tvr Trp Gly Gin Gly His Leu Val Thr Val Ser Ser 
115 120 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .327 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GAA ACG ACA CTC ACG CAG TCT CCA GGC ACC CTG TCT TTG TCT CCG GGG 
Glu Thr Thr Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly 
12 5 130 135 140 

GAA AGA GCC ACC CTC TCC TGC AGG GCC AGT CAG AAT ATT GGC AGC AGC 
Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gin Asn He Gly Ser Ser 
145 150 15= 

TCC TTA GCC TGG TAC CAA CAG AAA CCT GGC CAG GCT CCC AGG CTC CTC 
Ser Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu 
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160 165 170 

ATC TAT GGT GCA TCC ACC AGG GCC ACT GGT ATC CCA GCC AGG TTC AGT 192 
lie Tyr Gly Ala Ser Thr Arg Ala Thr Gly lie Pro Ala Arg Phe Ser 
175 180 185 

GGC AGT GGG TCA GGG ACA CAA TTC ACT CTC ACC ATC AGC AGC CTG CAG 240 
Gly Ser Gly Ser Gly Thr Gin Phe Thr Leu Thr lie Ser Ser Leu Gin 
190 195 200 

TCT GAA GAT TTT GCA GTT TAT TAC TGT CAG CAG TAT AAT TTC TGG CCA 288 
Ser Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Tyr Asn Phe Trp Pro 
205 210 215 220 

TTC ACT TTT GGC CCT GGG ACC AAG CTG GAG ATC AAA CGT 327 
Phe Thr Phe Gly Pro Gly Thr Lys Leu Glu lie Lys Arg 
225 230 



(2} INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 109 amino acids 
(B> TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

Glu Thr Thr Leu Thr Gin Ser Pro Gly Thr Leu Ser Leu Ser Pro Gly 
15 10 15 

Glu Arg Ala Thr Leu Ser Cys Arg Ala Ser Gin Asn lie Glv Ser Ser 
20 25 30 

Ser Leu Ala Trp Tyr Gin Gin Lys Pro Gly Gin Ala Pro Arg Leu Leu 
35 40 45 

lie Tvr Gly Ala Ser Thr Arg Ala Thr Gly lie Pro Ala Arg Phe Ser 
50 55 60 

Glv Ser Gly Ser Gly Thr Gin Phe Thr Leu Thr He Ser Ser Leu Gin 
65 70 75 80 

Ser Glu Asp Phe Ala Val Tyr Tyr Cys Gin Gin Tyr Asn Phe Trp Pro 
85 90 95 

Phe Thr Phe Gly Pro Gly Thr Lys Leu Glu He Lys Arg 
100 105 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 



SUBSTITUTE SHEET (RULE 26) 



WO 99/11785 



PCT/GB98/02630 



(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
TTTTCTGCAG TAATAGGCGG CCGCAGGGGG AGGAGGGTCC ATCGAAGGTC GCGAAGCAGA 
GACTGTTGAA AG 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 75 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
TTTTGAATTC TTATTAACCA CCGAACTGCG GGTGACGCCA AGCGCTTGCG GCCGTTAAGA 
CTCCTTATTA CGCAG 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DMA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

AAAAGCGGCC GCACTGGCCT GAGAGANNNN NN 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 16 base pairs 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TCGACCCACG CGTCCG 16 
(2) INFORMATION FOR SEQ ID NO: 23: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

{ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GGGTGCCGAG GC 12 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
AAAAGAATTC TGAATGTGTT ATTTTAGCTC AGGTCACTCT GACCTGATTA TCAAGACCCC 60 
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ACTTCACCCC CT 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
AAAAGCGGCC GCAGGGGGAG GAGGGTCCAT GGAATCTGCC AAGGAG 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii} MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

( iii ) HYPOTHETI CAL : NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GTAAAACGAC GGCCAGT 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GGATAACAAT TTCACACAGG 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AAAGCGGCCG CACTGGCCTG AGAGA 
{2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

{iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
AAAAGGCCCA GCCGGCCATG GCCCAGCCCA CCACGCGTCC G 
{2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
AAAAGGCCCA GCCGGCCATG GCCCAGTCCC ACCACGCGTC CG 4 2 

(2] INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
AAAAGGCCCA GCCGGCCATG GCCCAGTACC CACCACGCGT CCG 43 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: MO 

(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
AAAAGAATTC GAGGTTACAT TAACTTTGTT CCGGTCAGAC TGACCCAAGT CGACCTGAAT 
GTGTTATTTT AG 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 
CCTTGGCAGA TTCCATCTCG GCCATTGCCG GC 
(2) INFORMATION FOR SEQ ID NO: 34: 

{i} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 
CCGGCAATGG CCGAGATGGA ATCTGCCAAG G 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
TTTTGTCGAC TCAATCAGTT ATGCGGCCGC CAGCTGCAGG AGGGCCGGCT GGGCCGACCC 60 
TCCTCCCCCA GACCCCACTT CACCCC 86 
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(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
(C! STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii> HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
TGTTGAAACA CAAGCGCCAG 
(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
{iv) ANTI -SENSE: MO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
TGGAAGAGGC ACGTTCTTTT CTTT 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DMA" 

I iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
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CTCCTTCTTA CTCTTGCTGG CGGT 24 

(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 4 base pairs 
(B5 TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
AGACTCTCCC CTGTTGAAGC TCTT 24 
(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



<xi) SEQUENCE DESCRIPTION: SEQ ID MO: 40: 
TGAAGATTCT GTAGGGGCCA CTGTCTT 27 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acLd 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
TGAACCGCCT CCACCTGAGG AGACGGTGAC CAGGGTGCC 39 
(2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii> MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TGAACCGCCT CCACCTGAAG AGACGGTGAC CATTGTCCC 39 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
TGAACCGCCT CCACCTGAGG AGACGGTGAC CAGGGTTCC 39 
(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /desc = "synthetic DNA" 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 
TGAACCGCCT CCACCTGAGG AGACGGTGAC CGTGGTCCC 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

TTTTTGGCCC AGCCGGCCAT GGCCCAGGTG CAGCTGGTGC AGTCTGG 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 47 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 
TTTTTGGCCC AGCCGGCCAT GGCCCAGGTC AACTTAAGGG AGTCTGG 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA' 
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(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
TTTTTGGCCC AGCCGGCCAT GGCCGAGGTG CAGCTGGTGG AGTCTGG 47 
(2) INFORMATION FOR SEQ ID NO: 48: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
TTTTTGGCCC AGCCGGCCAT GGCCCAGGTG CAGCTGCAGG AGTCGGG 
(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
TTTTTGGCCC AGCCGGCCAT GGCCGAGGTG CAGCTGTTGC AGTCTGC 
(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
TTTTTGGCCC AGCCGGCCAT GGCCCAGGTA CAGCTGCAGC AGTCAGG 47 
(2! INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
TTATTCGCGG CCGCCTAAAC AGAGGCAGTT CCAGATTTC 39 
(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: MO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
GTCACTTGCG GCCGCCTACA GTGTGGCCTT GTTGGCTTG 39 
(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
TCTGGCGGTG GCGGATCGGA CATCCAGATG ACCCAGTCTC C 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
TCTGGCGGTG GCGGATCGGA TGTTGTGATG ACTCAGTCTC C 
(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
TCTGGCGGTG GCGGATCGGA AATTGTGTTG ACGCAGTCTC C 
(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 
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(A) DESCRIPTION: /desc = "synthetic DNA" 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
TCTGGCGGTG GCGGATCGGA CATCGTGATG ACCCAGTCTC C 
{2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
TCTGGCGGTG GCGGATCGGA AACGACACTC ACGCAGTCTC C 
(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
TCTGGCGGTG GCGGATCGGA AATTGTGCTG ACTCAGTCTC C 
(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 
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(C> STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DMA" 

(iii) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
TTCTCGTGCG GCCGCCTAAC GTTTGATTTC CACCTTGGTC CC 42 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
TTCTCGTGCG GCCGCCTAAC GTTTGATCTC CAGCTTGGTC CC 42 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
TTCTCGTGCG GCCGCCTAAC GTTTGATATC CACTTTGGTC CC 42 
(2) INFORMATION FOR SEQ ID NO: 62: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

TTCTCGTGCG GCCGCCTAAC GTTTGATCTC CACCTTGGTC CC 

(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
TTCTCGTGCG GCCGCCTAAC GTTTAATCTC CAGTCGTGTC CC 
(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
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TCTGGCGGTG GCGGATCGCA GTCTGTGTTG ACGCAGCCGC C 
41 

(2) INFORMATION FOR SEQ ID NO : 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

TCTGGCGGTG GCGGATCGCA GTCTGCCCTG ACTCAGCCTG C 
41 

(2) INFORMATION FOR SEQ ID NO : 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthesis DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

TCTGGCGGTG GCGGATCGTC CTATGTGCTG ACTCAGCCAC C 
41 

(2) INFORMATION FOR SEQ ID NO: 67: 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D> TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthesis DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 67: 
TCTGGCGGTG GCGGATCGTC TTCTGAGCTG ACTCAGGACC C 41 
(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthesis DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
TCTGGCGGTG GCGGATCGCA CGTTATACTG ACTCAACCGC C 41 
(2) INFORMATION FOR SEQ ID NO: 69: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthesis DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
TCTGGCGGTG GCGGATCGCA GGCTGTGCTC ACTCAGCCGT C 41 
(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthesis DNA" 
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(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
TCTGGCGGTG GCGGATCGAA TTTTATGCTG ACTCAGCCCC A 41 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: singie 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc- = "synthetic DMA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
TTCTCGTGCG GCCGCCTAAC CTAGGACGGT GACCTTGGTC CC 42 
(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
TTCTCGTGCG GCCGCCTAAC CTAGGACGGT CAGCTTGGTC CC 
(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
TTCTCGTGCu GCCGCCTAAC CTAAAACGGT GAGCTGGGTC CC 42 
(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
CGATCCGCCA CCGCCAGA 18 
(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DNA" 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
GTCTCCTCAG GTGGAGGC 
(2) INFORMATION FOR SEQ ID NO: 76: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 54 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "synthetic DMA" 

(iii) HYPOTHETICAL: NO 

(iv> ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 
CGATCCGCCA CCGCCAGAGC CACCTCCGCC TGAACCGCCT CCACCTGAGG AGAC 
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